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L7 1 L6 AND ( (L4 OR L5) ) 

=> D CBIB ABS 

L7 ANSWER 1 OF 1 CAPLUS COPYRIGHT 2002 ACS 

1987:170241 Document No. 106:170241 Bacterial polypeptide expression 

employing tryptophan promoter-operator. Kleid, Dennis G.; Yansura, Daniel 
G.; Heyneker, Herbert L.; Miozzari, Giuseppe F. (Genentech, Inc., USA). 
Can. CA 1213539 A2 19861104, 50 pp. Division of Can. Appl. No. 373,565. 
(English). CODEN: CAXXA4 . APPLICATION: CA 1985-482003 19850521. 
PRIORITY: US 1980-133296 19800324; CA 1981-373565 19810320. 

AB A method for cleaving double-stranded DNA at any point, even in the 

absence of a restriction recognition site, is developed and used in the 
construction of expression plasmids contg. heterologous genes under the 
control of the trp promoter-operator lacking the attenuator for efficient 
expression in Escherichia coli without tryptophan starvation. The method 
comprises (1) converting the double-stranded DNA to single-stranded DNA in 
the region surrounding the intended cleavage point by reaction with 
.lambda, exonuclease; (2) hybridizing a DNA primer to the single-stranded 



5na formed such that the 5' end of the primer is coterminus with the 
nucleotide on the single-stranded DNA just prior to the intended cleavage 
site; (3) extending the primer in the 3' direction with DNA 

***polymerase*** ; and (4) simultaneously or thereafter, digesting away 
the portion of the single-stranded DNA beyond the intended cleavage point, 
Plasmid pGMl from which the trp attenuator region within the leader 
sequence had been deleted contained the trp promoter-operator (trp p.o.) 
region operatively linked to the codons for, from 5* to 3', the 1st 6 
amino acids of the trp leader peptide (L) , the distal regions of the trpE 
protein (E'), and the entire trpD protein (D) . Construction of an 
expression vector carrying a somatostatin-trpLE ' chimeric gene under the 
control of the trp p.o. was carried out by (1) excising from pGMl the 
EcoRI-PvuII fragment carrying trp p.o., LE * , and the 5' half of D (D'), 
and inserting the fragment in the EcoRI site of plasmid pSOMII carrying 
the somatostatin gene to obtain pS0M7 . DELTA. 2 ; with Hindlll which cut at 
the 5* region of D'; (3) treating the linearized plasmid with .lambda, 
exonuclease until the single-stranded region extended beyond the 3' end of 
LE*/ (4) hybridizing a primer having its 5* nucleotide complementary to 
the 3* nucleotide of LE* to the single-stranded region, and extending it 
using Klenow fragment; (5) digesting away the single-stranded region left 
with 3' to 5' exonuclease (6) excising the trp p.o.-LE' fragment with 
Bglll, and converting the blunt 3' end of LE ' to EcoRI site; and (7) 
ligating the fragment obtained in 6 to pS0M7 . DELTA. 2 having the 
Bglll-EcoRI fragment excised, yielding plasmid pS0M7 . DELTA. 2 . DELTA. 4 with 
the entire D' deleted and with the somatostatin gene fused to LE ' under 
the control of the trp p.o. 



-> S L6 AND L4 

L8 1 L6 AND L4 

=> S L6 AND L5 
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=> S THERMUS;S THERMOSTABLE 
Lll 2607 THERMUS 



L12 9824 THERMOSTABLE 
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L13 23 LIO AND Lll 

==> S LIO AND L12 

L14 17 LIO AND L12 

=> S L13,L14 

L15 27 (L13 OR L14) 

=> D 1-27 TI 

L15 ANSWER 1 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Detection of RNA targets using INVADER oligonucleotide-directed cleavage 
reactions and construction of modified Thermus polymerase enzymes with 
thermostable 5 '-nuclease activities 

L15 ANSWER 2 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI RNA template-dependent ★^★s^^r** i ***nuclease*** activity of 

***Thermus**-^ aquaticus and ***Thermus*** thermophilus DNA 
***polymerases*** 

L15 ANSWER 3 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Invasive cleavage of nucleic acids for detecting and characterizing target 
nucleic acids and microbial nucleases for the methods 

LIS ANSWER 4 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Detection of nucleic acids by invader-directed cleavage 



LIS ANSWER 5 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Experimental and theoretical analysis of the invasive signal amplification 
reaction 

L15 ANSWER 6 OF 27 CAPLUS COPYRIGHT 2002 ACS 
TI Detection of RNA by invader-directed cleavage 

L15 ANSWER 7 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Detection of nucleic acids by multiple sequential invasive cleavages 
L15 ANSWER 8 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Invasive cleavage of nucleic acids with ***thermostable*** ★★★5*vr* 
i_ ***nuclease*** for mutation detection and diagnostic applications. 

L15 ANSWER 9 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Comparison of the ★★★s**^ » ***nuclease*** activities of Taq DNA 
***polymerase*** and its isolated nuclease domain 

LIS ANSWER 10 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Method for the determination of large parvovirus B19 concentrations in 

blood using ***polymerase*** chain reaction at suboptimal temperature 
in conjunction with an additional fluorescent reporter primer 

LIS ANSWER 11 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Primary structure of the DNA ***polymerase*** I gene of an 

. alpha . -proteobacterium, Rhizobium leguminosarum, and comparison with 
other family A DNA ***polymerases*** 

LIS ANSWER 12 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Mutant chimeric ***Thermus*** /Tma DNA ***polymerases*** with 
improved properties for nucleic acid sequencing 

LIS ANSWER 13 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI One-tube fluorogenic reverse transcription- ***polymerase*** chain 
reaction for the quantitation of feline coronaviruses 

LIS ANSWER 14 OF 27 CAPLUS COPYRIGHT 2002 ACS 
TI Rapid detection of mutations in the p53 gene 

LIS ANSWER IS OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Cleavage of nucleic acid acid using ***thermostable*** Methanococcus 
jannaschii FEN-1 endonucleases 

LIS ANSWER 16 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Detection of nucleic acids and sequence variations by multiple sequential 
invasive cleavages 

LIS ANSWER 17 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Invasive cleavage of nucleic acids for detecting and characterizing target 
nucleic acids and microbial nucleases for the methods 

LIS ANSWER 18 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Advances in quantitative PGR technology: ^* + 5*:*r^ » ^^^nuclease*** 
assays 

LIS ANSWER 19 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Invasive cleavage of nucleic acids for detecting and characterizing target 
nucleic acids and microbial nucleases for the methods 

LIS ANSWER 20 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI ***5*** I ***nucleases**^ derived from ***thermostable*** DNA 

***polymerases*** and their use in a nucleic acid detection method 

LIS ANSWER 21 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI DNA sequences encoding designed ***Thermus**^ DNA ***polymerase*** 
mutants that are synthesis-deficient, ***thermostable*^* , and useful 
for DNA site-specific cleavage and detection 

LIS ANSWER 22 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Structure of Taq ***polymerase*** with DNA at the ***polymerase*^^ 
active site 



L15 ANSWER 23 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Nucleic acid detection and identification using site-specific cleavage, 

especially for analysis of human disease-related mutant gene or microbial 
pathogen nucleic acid analysis 

L15 ANSWER 24 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI A PCR-based assay for the detection of Escherichia coli Shiga-like toxin 
genes in ground beef 

L15 ANSWER 25 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Crystal structure of ***Thermus*** aquaticus DNA ***polymerase*** 
L15 ANSWER 2 6 OF 27 CAPLUS COPYRIGHT 2002 ACS 

TI Topographical characterization of the DNA ***polymerase*** from 

***Thermus*** aquaticus. Defining groups of inhibitor mAbs by epitope 
mapping and functional analysis using surface plasmon resonance 

L15 ANSWER 27 OF 27 CAPLUS COPYRIGHT 2002 ACS 

Tj -kirirc^-k-k-*: ._ * * *nucleas es * * * derived from ***thermostable*** DNA 

***polymerases*** and their use in a nucleic acid detection method 



-> D 1-9,12,19-21,27 CBIB ABS 

L15 ANSWER 1 OF 27 CAPLUS COPYRIGHT 2002 ACS 

2001:868661 Document No. 136:49292 Detection of RNA targets using INVADER 
oligonucleotide-directed cleavage reactions and construction of modified 
Thermus polymerase enzymes with thermostable 5 '-nuclease activities. 
Allawi, Hatim; Bartholomay, Christian Tor; Chehak, Luanne; Curtis, 
Michelle L.; Eis, Peggy S.; Hall, Jeff G. ; Ip, Hon S.; Kaiser, Michael; 
Kwiatkowski, Robert W., Jr.; Lukowiak, Andrew A.; Lyamichev, Victor; Ma, 
Wupo; Olson-munoz, Marilyn C; Olson, Sarah M.; Schaefer, James J.; 
Skrzypczynski, Zbigniew; Takova, Tsetska Y.; Vedvik, Kevin L.; Lyamichev, 
Natalie E.; Neri, Bruce P. (Third Wave Technologies, Inc., USA). PCT Int. 
Appl. WO 2001090337 A2 20011129, 1266 pp. DESIGNATED STATES: W: AE, AG, 
AL, AM, AT, AU, AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, CZ, 
DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, 
IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, 
MN, MW, MX, MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW, AM, AZ, BY, KG, KZ, MD, RU, 
TJ, TM; RW: AT, BE, BF, BJ, CF, CG, CH, CI, CM, CY, DE, DK, ES, FI, FR, 
GA, GB, GR, IE, IT, LU, MC, ML, MR, NE, NL, PT, SE, SN, TD, TG, TR. 
(English). CODEN: PIXXD2 . APPLICATION: WO 2001-US17086 20010524. 
PRIORITY: US 2000-577304 20000524; US 2001-758282 20010111; US 2001-864426 
20010524; US 2001-864636 20010524. 

AB The present invention provides novel cleavage agents and polymerases for 
the cleavage and modification of nucleic acid. The cleavage agents and 
polymerases find use, for example, for the detection and characterization 
of nucleic acid sequences and variations in nucleic acid sequences. In 
some embodiments, the 5 '-nuclease activity of a variety of modified 
Thermus polymerase enzymes is used to cleave a target-dependent cleavage 
structure, thereby indicating the presence of specific nucleic acid 
sequences or specific variations thereof. The term "cleavage structure" 
refers to a structure that is formed by the interaction of at least one 
probe oligonucleotide (called the INVADER oligonucleotide) and a target 
nucleic acid, forming a structure comprising a duplex, the resulting 
structure being cleavable by a cleavage agent including but not limited to 
an enzyme. A sample suspected of contg. the target sequence is contacted 
with oligonucleotides capable of forming an invasive cleavage structure in 
the present of the target sequence and with an agent for detecting the 
presence of the invasive cleavage structure. ARRESTOR oligonucleotides 
improve sensitivity of multiple sequential invasive cleavage assays and 
allow use of higher concns. of primary probe without increasing background 
signal. ^ The detailed description of the invention includes: (1) detection 
of specific nucleic acid sequences using 5* -nucleases in an 
INVADER-directed cleavage assay; (2) signal enhancement by incorporating 
the products of an invasive cleavage reaction into a subsequent invasive 
cleavage reaction; (3) effect of ARRESTOR oligonucleotides on signal and 
background in sequential invasive cleavage reactions; (4) improved enzymes 
for the use in INVADER oligonucleotide-directed cleavage reactions 



comprising RNA targets; (5) reaction design for INVADER assay detection of 
RNA targets; (6) kits for performing the RNA invader assay; and (7) the 
INVADER assay for direct detection and measurement of specific RNA 
analytes . 

L15 ANSWER 2 OF 27 CAPLUS COPYRIGHT 2002 ACS 

2000:579036 Document No. 133:318891 RNA template-dependent *-^-^s*** ' 

***nuclease*** activity of ***Thermus*** aquaticus and 

***Thermus*** thermophilus DNA ***polymerases*** . Ma, Wu-Po; 
Kaiser, Michael W.; Lyamicheva, Natasha; Schaefer, James J.; Allawi, Hatim 
T.; Takova, Tsetska; Neri, Bruce P.; Lyamichev, Victor I. (Third Wave 
Technologies, Inc., Madison, WI, 53719, USA). J. Biol. Chem., 275(32), 
24693-24700 (English) 2000. CODEN: JBCHA3 . ISSN: 0021-9258. Publisher: 
American Society for Biochemistry and Molecular Biology. 
AB DNA replication and repair require a specific mechanism to join the 3'- 
and 5 '-ends of two strands to maintain DNA continuity. In order to 
understand the details of this process, we studied the activity of the 

★★★5*** I ***nucleases*** with substrates contg. an RNA template 
strand. By comparing the eubacterial and archaeal ★★★5*^* » 

***nucleases*** , we show that the ***polymerase*** domain of the 
eubacterial enzymes is crit. for the activity of the ★★*5*** » 

***nuclease*** domain on RNA contg. substrates. Anal, of the activity 
of chimeric enzymes between the DNA ***polymerases*** from 

^**Thermus*** aquaticus (TaqPol) and ***Thermus*** thermophilus 
(TthPol) reveals two regions, in the thumb and in the palm subdomains, 
crit. for RNA-dependent ***5*** • ***nuclease*** activity. There 
are two crit. amino acids in those regions that are responsible for the 
high activity of TthPol on RNA contg. substrates. Mutating glycine 418 
and glutamic acid 507 of TaqPol to lysine and glutamine, resp., increases 
its RNA-dependent ★^★s*** » ***nuclease*** activity 4-10-fold. 
Furthermore, the RNA-dependent DNA ***polymerase*** activity is 
controlled by a completely different region of TaqPol and TthPol, and 
mutations in this region do not affect the ^★★s*** * ***nuclease*** 
activity. The results presented here suggest a novel substrate binding 
mode of the eubacterial DNA ***polymerase*** enzymes, called a 

***5^** » ***nuclease*** mode, that is distinct from the polymg. and 
editing modes described previously. The application of the enzymes with 
improved RNA-dependent * + *5:*r** i ***nuclease*** activity for RNA 
detection using the invasive signal amplification assay is discussed. 

L15 ANSWER 3 OF 27 CAPLUS COPYRIGHT 2002 ACS 

2000:492042 Document No. 133:116707 Invasive cleavage of nucleic acids for 
detecting and characterizing target nucleic acids and microbial nucleases 
for the methods. Kaiser, Michael W.; Lyamichev, Victor I.; Lyamicheva, 
Natasha (Third Wave Technologies, Inc., USA). U.S. US 6090606 A 
20000718, 262 pp., Cont . -in-part of U.S. Ser. No. 756,386. (English). 
CODEN: USXXAM. APPLICATION: US 1996-758314 19961202. PRIORITY: US 
1996-599491 19960124; US 1996-682853 19960712; US 1996-756386 19961129; US 
1996-756376 19961202. 

AB Disclosed are methods for the detection and characterization of nucleic 
acid sequences and their variants by using structure-specific ★+*5^^* 
»„ ★★★nucleases*** derived from ***thermostable*** DNA 

***polymerases*** , e.g., of the FEN-1, RAD2, or XPG class of nucleases. 
The enzyme cleaves the target nucleic acid sequence at a structure formed 
via annealing with 2 pilot oligonucleotide sequences. Also disclosed are 
methods and devices for the sepn. of nucleic acid mols. based on charge. 
Also disclosed are methods for the detection of non-target cleavage 
products via the formation of a complete and activated protein binding 
region. Isolation of genes for endonuclease FEN-1 from Pyrococcus woesei 
and other microorganisms were described. Prepn. of ★★★s*** »_ 

***nucleases*** by deleting the C-terminal ***polymerase*** domain 
or by point mutations of Taq DNA ***polymerase*** was shown. The 
cleavage method was used for the identification of hepatitis C virus and 
human ras gene. 

L15 ANSWER 4 OF 27 CAPLUS COPYRIGHT 2002 ACS 

2000:492035 Document No. 133:115874 Detection of nucleic acids by 

invader-directed cleavage. Prudent, James R.; Hall, Jeff G.; Lyamichev, 
Victor I.; Brow, Mary Ann D.; Dahlberg, James E. (Third Wave Technologies, 
Inc., USA). U.S. US 6090543 A 20000718, 263 pp., Cont . -in-part of U.S. 
Ser. No. 756,386. (English). CODEN: USXXAM. APPLICATION: US 1996-759038 



19961202. PRIORITY: US 1996-599491 19960124; US 1996-682853 19960712; US 
1996-756386 19961129; US 1996-758314 19961202. 
AB The present invention relates to means for the detection and 

characterization of nucleic acid sequences, as well as variations in 
nucleic acid sequences, by an oligonucleotide-directed cleavage detection 
assay. The present invention also relates to methods for forming a 
nucleic acid cleavage structure on a target sequence and cleaving the 
nucleic acid cleavage structure in a site-specific manner. The 
structure-specific nuclease activity of a variety of enzymes is used to 
cleave the target-dependent cleavage structure, thereby indicating the 
presence of specific nucleic acid sequences or specific variations 
thereof. Derivs . of ***thermostable"^** DNA ***polymerases*** and 
their mutants that retain their ★★★s*** »_ ***nuclease*** activity 
but lack ***polymerase*** activity are described for use in the 
nucleic acid detection system. The nuclease activity cleaves the 
single-stranded moiety of a Y-shaped structure and so is of use in 
selected cleavage of reporter sequences in a hybridization assay that 
includes ★★★s*** f_ ***nuclease*** -dependent cleavage and 
amplification steps. The present invention further relates to methods and 
devices for the sepn. of nucleic acid mols. based on charge. The cleavage 
method was used for the identification of hepatitis C virus. 

L15 ANSWER 5 OF 27 CAPLUS COPYRIGHT 2002 ACS 

2000:457899 Document No. 133:330098 Experimental and theoretical analysis of 
the invasive signal amplification reaction. Lyamichev, Victor I.; Kaiser, 
Michael W.; Lyamicheva, Natasha E.; Vologodskii, Alexander V.; Hall, Jeff 
G.; Ma, Wu-Po; Allawi, Hatim T.; Neri, Bruce P. (Third Wave Technologies 
Inc., Madison, WI, 53719-1256, USA). Biochemistry, 39(31), 9523-9532 
(English) 2000. CODEN: BICHAW. ISSN: 0006-2960. Publisher: American 
Chemical Society. 

AB The invasive signal amplification reaction is a sensitive method for 

single nucleotide polymorphism detection and quant, detn. of viral load 
and gene expression. The method requires the adjacent binding of upstream 
and downstream oligonucleotides to a target nucleic acid (either DNA or 
RNA) to form a specific substrate for the structure-specific ^★★s*^* » 

***nucleases*** that cleave the downstream oligonucleotide to generate 
signal. By running the reaction at an elevated temp., the downstream 
oligonucleotide cycles on and off the target leading to multiple cleavage 
events per target mol . without temp, cycling. We have examd. the 
performance of the FENl enzymes from Archaeoglobus fulgidus and 
Methanococcus jannaschii and the DNA ^^^polymerase*** I homologues 
from ***Thermus*** aquaticus and ***Thermus*** thermophilus in the 
invasive signal amplification reaction. We find that the reaction has a 
distinct temp, optimum which increases with increasing length of the 
downstream oligonucleotide. Raising the concn. of either the downstream 
oligonucleotide or the enzyme increases the reaction rate. When the 
reaction is configured to cycle the upstream instead of the downstream 
oligonucleotide, only the FENl enzymes can support a high level of 
cleavage. To investigate the origin of the background signal generated 
during the invasive reaction, the cleavage rates for several nonspecific 
substrates that arise during the course of a reaction were measured and 
compared with the rate of the specific reaction. We find that the 
different +**5*** • ***nuclease*** enzymes display a much greater 
variability in cleavage rates on the nonspecific substrates than on the 
specific substrate. The exptl. data are compared with a theor. model of 
the invasive signal amplification reaction. 

L15 ANSWER 6 OF 27 CAPLUS COPYRIGHT 2002 ACS 

1999:794247 Document No. 132:31733 Detection of RNA by invader-directed 

cleavage. Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, 
Victor; Olive, David Michael; Prudent, James Robert (Third Wave 
Technologies, Inc., USA). U.S. US 6001567 A 19991214, 167 pp., 
Cont .-in-part of U.S. 5,846,717. (English). CODEN: USXXAM. APPLICATION: 
US 1996-682853 19960712. PRIORITY: US 1996-599491 19960124. 

AB The present invention relates to means for the detection and 

characterization of nucleic acid sequences, as well as variations in 
nucleic acid sequences, by an oligonucleotide-directed cleavage detection 
assay. The present invention also relates to methods for forming a 
nucleic acid cleavage structure on a target sequence and cleaving the 
nucleic acid cleavage structure in a site-specific manner. The 
structure-specific nuclease activity of a variety of enzymes is used to 



cleave the target-dependent cleavage structure, thereby indicating the 
presence of specific nucleic acid sequences or specific variations 
thereof. Derivs . of ***thermostable*** DNA ***polyitierases*** and 
their mutants that retain their ***5*** »- ***nuclease*** activity 
but lack ***polymerase*** activity are described for use in the 
nucleic acid detection system. The nuclease activity cleaves the 
single-stranded moiety of a Y-shaped structure and so is of use in 
selected cleavage of reporter sequences in a hybridization assay that 
includes ★★★s*** »_ ***nuclease*** -dependent cleavage and 
amplification steps. The present invention further relates to methods and 
devices for the sepn. of nucleic acid mols. based on charge. The cleavage 
method was used for the identification of hepatitis C virus. 

L15 ANSWER 7 OF 27 CAPLUS COPYRIGHT 2002 ACS 

1999:761460 Document No. 132:9599 Detection of nucleic acids by multiple 

sequential invasive cleavages. Hall, Jeff G.; Lyamichev, Victor I.; Mast, 
Andrea L.; Brow, Mary Ann D, (Third Wave Technologies, Inc., USA). U.S. 
US 5994069 A 19991130, 306 pp., Cont . -in-part of U.S. Ser. No. 759,038. 
(English). CODEN: USXXAM. APPLICATION: US 1997-823516 19970324. 
PRIORITY: US 1996-599491 19960124; US 1996-682853 19960712; US 1996-756386 
19961129; US 1996-759038 19961202; US 1996-758314 19961202; WO 1997-US1072 
19970122. 

AB The present invention relates to means for the detection and 

characterization of nucleic acid sequences, as well as variations in 
nucleic acid sequences, by an Invader. RTM. oligonucleotide-directed 
cleavage detection assay. The present invention also relates to methods 
for forming a nucleic acid cleavage structure on a target sequence and 
cleaving the nucleic acid cleavage structure in a site-specific manner. 
The structure-specific nuclease activity of a variety of enzymes is used 
to cleave the target-dependent cleavage structure, thereby indicating the 
presence of specific nucleic acid sequences or specific variations 
thereof. Derivs. of ***thermostable*** DNA ***polymerases*** and 
their mutants that retain their ★:*r*5**+ i_ ***nuclease*** activity 
but lack ***polymerase*** activity are described for use in the 
nucleic acid detection system. The nuclease activity cleaves the 
single-stranded moiety of a Y-shaped structure and so is of use in 
selected cleavage of reporter sequences in a hybridization assay that 
includes *+*5*** i_ ^^^nuclease*** -dependent cleavage and 
amplification steps. The present invention further relates to methods and 
devices for the sepn, of nucleic acid mols. based on charge. The present 
invention also provides methods for the detection of non-target cleavage 
products via the formation of a complete and activated protein binding 
region. The invention further provides sensitive and specific methods for 
the detection of human cytomegalovirus nucleic acid in a sample. 

L15 ANSWER 8 OF 27 CAPLUS COPYRIGHT 2002 ACS 

1999:732986 Document No, 131:347456 Invasive cleavage of nucleic acids with 
***thermostable*** ★★★S*** »_ ***nuclease*** for mutation 

detection and diagnostic applications.. Prudent, James R.; Hall, Jeff G,; 
Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E. (Third Wave 
Technologies, Inc., USA). U.S. US 5985557 A 19991116, 182 pp., 
Cont .-in-part of U.S. Ser. No. 682,853. (English). CODEN: USXXAM. 
APPLICATION: US 1996-756386 19961129. PRIORITY: US 1996-599491 19960124; 
US 1996-682853 19960712. 

AB The present invention relates to means for the detection and 

characterization of nucleic acid sequences, as well as variations in 
nucleic acid sequences. The present invention also relates to methods for 
forming a nucleic acid cleavage structure on a target sequence and 
cleaving the nucleic acid cleavage structure in a site-specific manner. 
The structure-specific ★ + ★5*:*.* »_ ***nuclease*^* activity of a 
variety of enzymes is used to cleave the target-dependent cleavage 
structure, thereby indicating the presence of specific nucleic acid 
sequences or specific variations thereof. These ★★★s*** »_ 

***nucleases*** are capable of cleaving linear duplex structures to 
create single discrete cleavage products identified using fluorescence 
imaging. The reaction involves a trigger and a detection reaction where a 
hairpin conformation is recognized. Here the target nucleic acid is not 
completely complementary to at least one of the first, second, third and 
fourth oligonucleotides. Assays where the target nucleic acid is reused 
or recycled during multiple rounds of hybridization with oligonucleotide 
probes and cleavage without the need to use temp, cyclin or nucleic acid 



synthesis. Through the interaction of the cleavage means an upstream 
oligonucleotide can be made to cleave a downstream oligonucleotide at an 
internal site in such a way that the resulting fragments of the downstream 
oligonucleotide dissocd. from the target nucleic acid, thereby making that 
region of the target nucleic acid available for hybridization to another, 
uncleaved copy of the downstream oligonucleotide. The specific stability 
designed into the invader and probe sequences will depend on the temp, at 
which one desires to perform the reaction. It is desirable that the 
invader oligonucleotide be immediately available to direct the cleavage of 
each probe oligonucleotide that hybridizes to a target nucleic acid. For 
this reason, the invader oligonucleotide is provided in excess over the 
probe oligonucleotide. The non-target cleavage products are incubated 
with a template-independent ***polymerase*** and one nucleoside 
triphosphate under conditions such that at least one nucleotide is added 
to the 3'-hydroxyl group of the non-target cleavage products to generate 
tailed products. The present invention also provides novel methods and 
devices for the sepn. of nucleic acid mols. by charge by charge reversal. 
When an oligonucleotide is shortened through the action of a CLEAVASE 
enzyme or other cleavage agent, the pos . charge can be made to not only 
significantly reduce the net neg. charge, but to actually override it, 
effectively "flipping" the net charge of the labeled entity. The reversal 
of charge allows the products of target-specific cleavage to be 
partitioned from uncleaved probe by extremely simple means. It has clin. 
diagnostic applications as multiple alleles could be screened at once. 

L15 ANSWER 9 OF 27 CAPLUS COPYRIGHT 2002 ACS 

1999:456712 Document No. 131:225339 Comparison of the -k-ki.c^-k-k^ i 

***nuclease*** activities of Taq DNA ***polymerase*** and its 
isolated nuclease domain. Lyamichev, Victor; Brow, Mary Ann D.; Varvel, 
Virgil E.; Dahlberg, James E. (Department of Biomolecular Chemistry, 
University of Wisconsin, Madison, WI, 53706, USA) . Proceedings of the 
National Academy of Sciences of the United States of America, 96(11), 
6143-6148 (English) 1999. CODEN: PNASA6. ISSN: 0027-8424. Publisher: 
National Academy of Sciences. 
AB Many eubacterial DNA ***polymerases*** are bifunctional mols. having 

both polymn. (P) and ★★★s*** ' ***nuclease*** (N) activities, which 
are contained in separable domains. We previously showed that the DNA 

***polymerase*** I of ***Thermus*** aquaticus (TaqNP) 
endonucleolytically cleaves DNA substrates, releasing unpaired 5* arms of 
bifurcated duplexes. Here, we compare the substrate specificities of 
TaqNP and the isolated ★★★s + ^t* » ***nuclease*** domain of this 
enzyme, TaqN. Both enzymes are significantly activated by primer 
oligonucleotides that are hybridized to the 3' arm of the bifurcation; 
optimal stimulation requires overlap of the 3' terminal nucleotide of the 
primer with the terminal base pair of the duplex, but the terminal 
nucleotide need not hybridize to the complementary strand in the 
substrate. In the presence of Mn2+ ions, TaqN can cleave both RNA and 
circular DNA at structural bifurcations. Certain anti-TaqNP mAbs block 
cleavage by one or both enzymes, whereas others can stimulate cleavage of 
non-optimal substrates . 
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1999:64619 Document No. 130:121430 Mutant chimeric ***Thermus*** /Tma DNA 
^^'^^polymerases*** with improved properties for nucleic acid sequencing. 
Gelfand, David Harrow; Reichert, Fred Lawrence (F. Hoffmann-La Roche Ag, 
Switz.). Eur. Pat. Appl . EP 892058 A2 19990120, 47 pp. DESIGNATED 
STATES: R: AT, BE, CH, DE, DK, ES, FR, GB, GR, IT, LI, LU, NL, SE, MC, 
PT, IE, SI, LT, LV, FI, RO . (English). CODEN: EPXXDW. APPLICATION: EP 
1998-112327 19980703. PRIORITY: US 1997-52065 19970709. 
AB The invention provides mutant, chimeric ***thermostable*** DNA 

***polymerase*** enzymes consisting of an N-terminal region derived from 
the ★★★s*** »- ***nuclease*** domain of a ***Thermus*** species 
DNA ***polymerase*** and a C-terminal region derived from the 3* to 5' 
exonuclease and ***polymerase*** domains of Tma DNA ***polymerase*** 

These mutant chimeric ***thermostable*** DNA ***polymerase*** 
enzymes have improved properties in nucleic acid sequencing reactions. 
Also provided are nucleic acids encoding said mutant chimeric 

***thermostable*** DNA ***polymerase*** enzymes, vectors comprising 
said nucleic acids and host cells transformed with said vectors. Also 
provided are compns. comprising said mutated, chimeric 

***thermostable*** DNA ***polymerase*** enzymes and non-ionic 



polymeric detergent (s) . Furthermore methods for producing the said 
enzymes and methods and kits for using the said enzymes are provided. 
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1997:513643 Document No. 127:202064 Invasive cleavage of nucleic acids for 
detecting and characterizing target nucleic acids and microbial nucleases 
for the methods. Hall, Jeff G.; Lyamichev, Victor I.; Prudent, James R, ; 
Brow, Mary Ann D. ; Kaiser, Michael W.; Lyamichev, Natasha; Olive, David 
Michael; Dahlberg, James E.; et al. (Third Wave Technologies, Inc., USA; 
Hall, Jeff G.; Lyamichev, Victor I.; Prudent, James R.; Brow, Mary Ann D.; 
Kaiser, Michael W.; Lyamichev, Natasha; Olive, David Michael; Dahlberg, 
James E.). PCT Int. Appi . WO 9727214 Al 19970731, 456 pp. DESIGNATED 
STATES: W: AU, CA, JP, US; RW: AT, BE, CH, DE, DK, ES, FI, FR, GB, GR, 
IE, IT, LU, MC, NL, PT, SE. (English). CODEN: PIXXD2 . APPLICATION: WO 
1997-US1072 19970122. PRIORITY: US 1996-599491 19960124; US 1996-682853 
19960712; US 1996-756386 19961129; US 1996-758314 19961202; US 1996-759038 
19961202. 

AB Disclosed are methods for the detection and characterization of nucleic 
acid sequences and their variants by using structure-specific **+5*** 
f_ *^*nucleases*** derived from ***thermostable*** DNA 

***polymerases*** , e.g., of the FEN-1, RAD2, or XPG class of nucleases. 
The enzyme cleaves the target nucleic acid sequence at a structure formed 
via annealing with 2 pilot oligonucleotide sequences. Also disclosed are 
methods and devices for the sepn. of nucleic acid mols. based on charge. 
Also disclosed are methods for the detection of non-target cleavage 
products via the formation of a complete and activated protein binding 
region. Isolation of genes for endonuclease FEN-1 from Pyrococcus woesei 
and other microorganisms were described. Prepn. of ★★★5*** »_ 

***nucleases*** by deleting the C-terminal ***polymerase*** domain 
or by point mutations of Taq DNA * **polymerase*** was shown. The 
cleavage method was used for the identification of hepatitis C virus and 
human ras gene. 
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1997:260104 Document No. 126:260880 ★★★s*** » ***nucleases*** derived 
from ^**thermostable^** DNA ***polymerases*** and their use in a 
nucleic acid detection method. Dahlberg, James E.; Lyamichev, Victor I.; 
Brow, Mary Ann D. (Third Wave Technologies, Inc., USA). U.S. US 5614402 A 
19970325, 93 pp. Cont , -in-part of U.S. 5,541,311. (English). CODEN: 
USXXAM. APPLICATION: US 1994-254359 19940606. PRIORITY: US 1992-986330 
19921207; US 1993-73384 19930604. 
AB Derivs. of ***thermostable**-^ DNA "^^^polymerases*** that retain 

their ★★+5*** »_ +**nuclease*** activity but lack ***polymerase*** 
are described for use in a nucleic acid detection system. The nuclease 
activity cleaves the single-stranded moiety of a Y-shaped structure and so 
is of use in selected cleavage of reporter sequences in a hybridization 
assay that includes two +**5*** ^^^nuclease"*^** -dependent cleavage 

and amplification steps. The presence of the target sequence is 
demonstrated by the release of the reporter moiety from sequences 
immobilized on a carrier. The ability of the nuclease activity to cleave 
such structures was shown by the inability of intact Taq 

***polymerase*** to amplify a hairpin sequence, although the 
nuclease-f ree Stoffel fragment could amplify the target sequence. The 
prepn. and characterization of a no, of ***polymerase*** mutants for 
use in these assays is demonstrated. Specific alterations of the 

***Thermus*** aquaticus Taq gene wee: a deletion between nucleotides 
1601 and 2502 (the end of the coding region), a 4-nucleotide insertion at 
position 2043, and deletions between nucleotides 1614 and 1848 and between 
nucleotides 875 and 1778. Three of these derived ***5*** <- 

***nucleases*** were designated Cleavase BX, Cleavase BB, and Cleavase 
BN. 
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1996:524264 Document No. 125:187560 DNA sequences encoding designed 
***Thermus*** DNA ***polymerase*** mutants that are 
synthesis-deficient, ***thermostable*** , and useful for DNA 
site-specific cleavage and detection. Dahlberg, James E.; Lyamichev, 
Victor I.; Brow, Mary Ann D. (Third Wave Technologies, Inc., USA). U.S. 
US 5541311 A 19960730, 76 pp. Cont . -in-part of U.S. Ser. No. 986,330, 
abandoned. (English) . CODEN: USXXAM. APPLICATION: US 1993-73384 
19930604. PRIORITY: US 1992-986330 19921207. 



'AB A' means for cleaving a nucleic acid cleavage structure in a site-specific 
manner is disclosed. A cleaving enzyme having ***5*** » 

***nuclease*** activity without interfering nucleic acid synthetic 
ability is employed as the basis of a novel method of detection of 
specific nucleic acid sequences. Cleaving enzymes are produced through 
the use of novel DNA sequences which encode novel ^thermostable*** 

***polymerases*** 
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1995:464450 Document No. 122:259841 ★★★5*** i, ***nucleases*** derived 
from ***thermostable*** DNA ***polymerases*** and their use in a 
nucleic acid detection method. Dahlberg, James E.; Lyamichev, Victor I.; 
Brow, Mary Ann D. (Third Wave Technologies, Inc., USA). PCT Int. Appl . WO 
9429482 Al 19941222, 158 pp. DESIGNATED STATES: W: AU, CA, JP; RW: AT, 
BE, CH, DE, DK, ES, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE. (English). 
CODEN: PIXXD2. APPLICATION: WO 1994-US6253 19940606. PRIORITY: US 
1993-73384 19930604. 

AB Derivs. of ***thermostable*** DNA ***polymerases*** that retain 

their ***5*** ***nuclease*** activity but lack ***polymerase*** 

are described for use in a nucleic acid detection system. The nuclease 
activity cleaves the single-stranded moiety of a Y-shaped structure and so 
is of use in selected cleavage of reporter sequences in a hybridization 
assay that includes two ★★*5*^* i_ ***nuclease*** -dependent cleavage 
and amplification steps. The presence of the target sequence is 
demonstrated by the release of the reporter moiety from sequences 
immobilized on a carrier. The ability of the nuclease activity to cleave 
such structures was shown by the inability of intact Taq 

***polymerase*** to amplify a hairpin sequence, although the 
nuclease-free Stoffel fragment could amplify the target sequence. The 
prepn. and characterization of a no. of ***polymerase*** mutants for 
use is in these assays is demonstrated. 
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5 "NERI BRUCE" /AU 
28 "NERI BRUCE P"/AU 

1 "NERI BRUCE PHILIP"/AU 
L22 80 ("NERI B"/AU OR "NERI BRUCE"/AU OR "NERI BRUCE P"/AU OR "NERI 

BRUCE PHILIP"/AU) 
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=> S L23 AND L6 

L24 21 L23 AND L6 

-> S L24 NOT (L15 OR L7) 

L25 4 L24 NOT (L15 OR L7) 

=> D 1-4 CBIB ABS 
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2001:935825 Document No. 136:65169 Methods for identification of 

"accessible" hybridization sites in nucleic acids and diagnostic uses 
thereof. ***Lyamichev, Victor*** ; ***Allawi, Hatim*** ; Dong, 
Fang; ***Neri, Bruce p.*** ; Vener, I. Tatiana (Third Wave 
Technologies, Inc., USA). PCT Int. Appl . WO 2001098537 A2 20011227, 409 
pp. DESIGNATED STATES: W: AE, AG, AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, BZ, CA, CH, CN, CO, CR, CU, CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, 
GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, 
LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, NO, NZ, PL, PT, RO, 
RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, 
ZA, ZW, AM, AZ, BY, KG, KZ, MD, RU, TJ, TM; RW: AT, BE, BF, BJ, CF, CG, 
CH, CI, CM, CY, DE, DK, ES, FI, FR, GA, GB, GR, IE, IT, LU, MC, ML, MR, 
NE, NL, PT, SE, SN, TD, TG, TR. (English). CODEN: PIXXD2 . APPLICATION: 
WO 2001-US19401 20010615. PRIORITY: US 2000-PV212308 20000617. 

AB The present invention relates to methods and compns . for analyzing nucleic 
acids, and in particular, methods and compns. for detection and 
characterization of nucleic acid sequences and sequence changes. The 
present invention also provides methods and compns. for identifying 
oligonucleotides with desired hybridization properties to nucleic acid 
targets contg. secondary structure. The invention also claims these 
methods for detection of HIV target sequences. Further, the invention 
claims an invasive cleavage assay for detection of HIV target sequences. 
The methods involve primers which are complementary to "accessible" and 
"inaccessible" target nucleic acid sites and a secondary primer/probe 
which is complementary to only one region 5' to the first region. This 
secondary primer/probe is complementary to a 5 ' region that at least 
partially overlaps the first region. Primers, called "extension" primers, 
which are complementary to an "accessible" target nucleic acid region can 
be extended in a template-dependent reaction by a ***polymerase*** or 
reverse transcriptase. Primers which are complementary to an 
"inaccessible" target nucleic acid region are not extended. The method 
further involves amplification of the extension products using first and 
second amplification primers. Examples of the invention include CFLP 
(cleavage fragment length polymorphism) anal, of a mutation in the 
Mycobacterium tuberculosis gene katG assocd, with isoniazid resistance, 
secondary structure anal, of M. tuberculosis gene rpoB fragments, anal, of 



Hepatitis C virus (HCV) subtypes la, lb, 2b, 2c, and 3a, and detection of 
HIV-1 sequences. For anal, of gene katG, 5 ' -tetrachlorof luorescein- 
labeled PGR fragments were created from wild-type, mutant (codon 315 G 
.fwdarw. C) , and non-wild-type sequences complementary to the mutant. 
Depending on the sequence, the PGR fragments can form a stem loop 
structure when denatured by heating and allowed to fold and the structures 
are cleaved at one site by the Cleavase I ***nuclease*** . GFLP 
products were analyzed by denaturing polyacrylamide gel electrophoresis or 
binding to a biotinylated capture probe and immobilization in 
streptavidin-coated wells in a microtiter plate. Using HGV and M. 
tuberculosis gene rpoB, bridging oligonucleotides, with sequences 
complementary to each side of a hairpin formed in a target nucleic acid 
fragment, were shown to distinguish different folded structures. Primer 
extension of bridging oligonucleotides and ligation of a bridging 
oligonucleotide with an adjacent oligonucleotide were also useful in 
discrimination of folded target structures. An invasive cleavage probe, 
also called an invasive bridging oligonucleotide, was used with a bridging 
oligonucleotide to create a target-dependent cleavage structure for an 
INVADER reaction assay with Afu FENl ***nuclease*** or Cleavase I. 
INVADER assays were applied to known accessible sites in human interferon 
.gamma. mRNA and to detect HIV-1 sequences. 
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1998:749424 Document No. 130:21326 Nucleic acid and conformation analysis by 
nucleic acid hybridization with pathogen detection. Dong, Fang; 

***Lyamichev, Victor I.*** ; Prudent, James R. ; Fors, Lance; ***Neri,* 
Bruce p.*** ; Brow, Mary Ann D.; Anderson, Todd A.; Dahlberg, James E 
(Third Wave Technologies, Inc., USA). PGT Int. Appl . WO 9850403 Al 
19981112, 279 pp. DESIGNATED STATES: W: AU, CA, JP, US; RW: AT, BE, CH, 
CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE. (English) . 
CODEN: PIXXD2. APPLICATION: WO 1998-US3194 19980505. PRIORITY: US 
1997-851588 19970505; US 1997-934097 19970919; US 1998-34205 19980303. 

AB The present invention relates to methods and compns. for treating nucleic 
acids, and in particular, methods and compns. for the detection and 
characterization of nucleic acid sequences and sequence changes. The 
invention provides methods for examg. the conformations assumed by single 
strands of nucleic acid, forming the basis of novel methods of detection 
of specific nucleic acid sequences. The present invention contemplates 
use of novel detection methods for, among other uses, clin. diagnostic 
purposes, including but not limited to the detection and identification of 
pathogenic organisms. Examples are presented for the anal, of 
Mycobacterium tuberculosis and hepatitis C virus genes. 
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1994:475320 Document No. 121:75320 Site-directed cleavage of nucleic acids 

using pilot oligonucleotides. Dahlberg, James E.; ***Lyamichev, Victor** 
★ l^-k-k-k . Brow, Mary Ann D. (Wisconsin Alumni Research Foundation, USA) 

Eur. Pat. Appl. EP 601834 Al 19940615, 22 pp. DESIGNATED STATES: R: AT, 
BE, CH, DE, DK, ES, FR, GB, GR, IE, IT, LI, LU, MC, NL, PT, SE. 
(English). CODEN: EPXXDW. APPLICATION: EP 1993-309827 19931207. 
PRIORITY: US 1992-986330 19921207. 
AB A method of cleaving a target nucleic acid mol . by use of an 

oligonucleotide with two domains is described. One of these domains is 
complementary to a sequence 5* or 3* to the cleavage site and the other 
domain is not complementary to the target DNA. Upon hybridization a 
Y-shaped complex is formed exposing the junction site for cleavage, e.g. 
with a ***nuclease*** . Suitable enzymes for cleaving at the junction 
include the thermostable ***nuclease*** activities of DNA 
***polymerase*** such as Taq, Tfl, Tth, and non-thermostable 
***polymerases*** such as the Escherichia coli enzyme and the gene 6 
protein of bacteriophage T7 . The presence of a 5 * -exonuclease activity in 
Taq ***polymerase*** is demonstrated and the enzyme is used to cleave 
a PGR amplification product. 
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1993:576439 Document No. 119:176439 Structure-specific endonucleolytic 
cleavage of nucleic acids by eubacterial DNA ***polymerases*** 

***Lyamichev, Victor*** ; Brow, Mary Ann D.; Dahlberg, James E. (Sch. 
Med., Univ. Wisconsin, Madison, WI, 53706, USA). Science (Washington, D. 
C, 1883-), 260(5109), 778-83 (English) 1993. CODEN: SCIEAS. ISSN: 
0036-8075. 



Previously known 5* exonucleases of several eubacterial DNA 

***polyinerases*** have now been shown to be structure-specific 
endonucleases that cleave single-stranded DNA or RNA at the bifurcated end 
of a base-paired complex. Cleavage was not coupled to synthesis, although 
primers accelerated the rate of cleavage considerably. The enzyme 
appeared to gain access to the cleavage site by moving from the free end 
of a 5' extension to the bifurcation of the duplex, where cleavage took 
place. Essentially any linear single-stranded nucleic acid can be 
targeted for specific cleavage by the 5* ***nuclease*** of DNA 

***polymerase*** through hybridization with an oligonucleotide that 
converts the desired cleavage site into a substrate. 
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(54) Mutant chimeric DNA polymerases 

(57) The invention provides mutant, chimeric ther- 
mostable DNA polymerase enzymes, which chimeric 
thermostable DNA polymerase enzymes consist of an 
N-terminal region derived from the S'-nuclease domain 
of a Thermus species DNA polymerase and a C-termi- 
nal region derived from the 3' to 5' exonuclease and- 
polymerase domains of Tma DNA polymerase. These 
mutant chimeric thermostable DNA polymerase 
enzymes have improved properties in nucleic acid 
sequencing reactions. Also provided are nucleic acids 
encoding said mutant chimeric thermostable DNA 
polymerase enzymes, vectors conprising said nucleic 
acids and host cells transformed with said vectors. Also 
provided are compositions conprising said mutated, 
chimeric thermostable DNA polymerase enzymes and 
non-ionic polymeric detergent(s), Futhermore methods 
for producing the said enzymes and methods and kits 
for using the said enzymes are provided. 
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Description 

Field of the Invention 

The present invention relates to a mutant chimeric thernrastaWe DNA polymerases, methods for their synthesis, 
and methods for their use. The enzymes are useful In many recombinant DNA techniques, especially in nucleic acid 
sequencing and in nucleic acid amplification by the polymerase chain reaction (PGR). 

Backoround Art 

Thermostable DNA polymerases, which catalyze the template-directed polymerization of deoxyribonucleoside tri- 
phosphates (dNTPs) to form DNA, are used in a variety of in vitro DNA synthesis applications, such as DNA sequencing 
and DNA amplification. Typically, naturally occurring DNA polymerases strongly discriminate against the incorporation 
of nucleotide analogues. TTils property contributes to the fidelity of DNA replication and repair. However, the incorpora- 
tion of nucleotide analogues is useful for many DNA synthesis applications, in particular, in DNA sequencing. 

DNA sequencing reactions using the chain termination method initially described by Sanger etaL, 1 977, Proc. Natl. 
Acad. Sci. 74:5463-5467, incorporated herein by reference, rely on an unconventional substrate, dideoxynucleoside tri- 
phosphate (ddNTP), for termination of synthesis. In the chain termination method, both the DNA polymerase's conven- 
tional substrate (dNTP) and a chain-terminating, unconventional substrate (ddNTP or labeled ddNTP) are present in 
the reaction. Synthesis proceeds until a ddNTP is incorporated. To insure that the chain-terminating ddNTPs are incor- 
porated at a suitable rate, the inherent discrimination of the previously utilized DNA polymerases against the incorpo- 
ration of ddNTPs was overcome by providing an excess of ddNTP. 

Dye-terminator sequencing, a variant of the chain termination method, uses ddNTPs labeled with fluorescent dyes, 
such as fluorescein or rhodamine, to terminate synthesis and. simultaneously, to label the synthesized DNA. The pres- 
ence of a dye label on the ddNTP can exacerbate the discrimination by the DNA polymerase against the incorporation 
of the unconventional substrate. 

Typically, sequencing by the chain termination method is carried out using repeated steps of primer extension fol- 
lowed by heat denaturation of the primer extension product-template duplex. This embodiment, refened to as cycle 
sequencing, is carried out in a thermal cycler using a thermostable DNA polymerase. Kits for carrying out cycle 
sequencing are commercially available from, for example, Perkin Elmer, Nonwalk, CT 

Thermostable DNA polymerases derived from a variety of organisms have been described extensively in the liter- 
ature and are well known to one of skill in the art. Particular examples include DNA polymerases from a variety of spe- 
cies of the genus Thermus (see U.S. Patent No. 5,466,591), in particular from Thermus aquaticus {Tag DNA 
polymerase) described in U.S. Patent Nos. 4,889,818; 5,352.600; and 5,079,352; and the DNA polymerase from Ther- 
matoga maritima {Tma DNA polymerase) described in U.S. Patent Nos. 5,374.553 and 5,420,029; all of which are 
incorporated herein by reference. 

DNA polymerases typically possess one or more associated exonucleoiytic activities. For example Tma DNA 
polymerase catalyzes the exonucleoiytic removal of nucleotides from tiie 5'-end of a double-stranded DNA (referred to 
as 5' to 3' exonuclease activity or 5'-nuclease activity) as well as from the 3'-end of a single- or double-stranded DNA 
(referred to as 3' to 5* exonuclease activity). In contrast, DNA polymerases from the genus Thermus possess only 5'- 
nuclease activity. A review of thermostable DNA polymerases and their associated activities is found in Abramson, 
1995. in PGR Strategies. (Innis etal. ed., Academic Press. Inc.). For use in DNA sequencing, a DNA polymerase that 
lacks associated exonucleoiytic activity, eitiier 5'-nuclease activity or 3* to 5' exonuclease activity, is prefened. Mutant 
forms of a number of tiiermostable DNA polymerases which lack 5'-nuclease activity are described in U.S. Patent No. 
5,466,591, incorporated herein by reference. 

European Patent Application 0 655 506, incorporated herein by reference, describes a mutated DNA polymerase 
with an enhanced ability to incorporate dideoxynucleotides (see also U.S. Patent No. 5,614,365, incorporated herein by 
reference). The mutation is a point mutation corresponding to amino acid 526 of T7 DNA polymerase. Examples of such 
mutations include mutations in amino acid 667 of Taq DNA polymerase. 

AmpliTaq® DNA polymerase FS, a mutant form of Taq DNA polymerase tiiat has essentially no 5'-nucIease activity 
and incorporates an F667Y mutation, is sold as a component of DNA cycle sequencing kits by Perkin Elmer, Nonwalk, 
CT The F667Y mutation results in a significant reduction in the discrimination against ddNTPs. This property greatiy 
improves the sequencing data obtained from a dye-terminator sequencing reaction and reduces the amount of ddNTPs 
required for each sequencing reaction. However, the use of AmpliTaq® DNA polymerase. FS has not eliminated prob- 
lems with non-uniformity of peak heights in the sequencing trace when used with the standard rhodamine dye family- 
labeled ddNTPs. An analysis of the peak height patterns obtained using AmpliTaq® DNA polymerase, FS in dye-termi- 
nator cycle sequencing reactions is described in Parker et aL, 1 996. BioTechniques 21 (4):694-699, incorporated herein 
by reference. 
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Conventional techniques of molecular biology and nucleic acid chemistry, which are within the skill of the art, are 
explained fully in the literature. See, for example, Sambrook etaL. 1989. Molecular Cloning - A Laboratory Manual. 
Cold Spring Harbor (.aboratory, Cold Spring Harbor, New York; OligonucleotidQ Synthesis (M.J. Gait, ed., 1984); 
Nucleic Acid Hybridization (B. D. Hames and S.J. Higgins. eds.. 1 984); and a series. Methods jn Enzvmoloav (Academic 
Press, Inc.), all of which are incorporated herein by reference. All patents, patent applications, and publications cited 
herein, both supra and infra, are incorporated herein by reference. 

Summary of the Invention 

The present invention relates to , mutant, chimeric thermostable DNA polymerases that possess significantly 
^I mproved properties relative to previously described t hftrm"^^^'^ nM/Tp^iyrv,^^,^^ jhn PM/\ pnlymrrnnr >1nlf1'i *r ih 
stantial improvements when used in DNA sequencing reactions. In particular, the DNA polymerase of the invention pro- 
vides the following combination of advantageous properties: 

• improved incorporation of ddNTPs; 

- improved uniformity of peak heights in DNA sequencing traces, in particular when used with dye-labeled ddNTPs 
in a cycle sequencing reaction; 

- reduced rate of pyrophosphorolysis of dye-labeled ddNTPs; and 
improved incorporation of dITR 

Furthermore, the DNA polymerase can be easily and efficiently expressed to a high level in a recombinant expression 
system, thereby facilitating commercial production of the enzyme. The combination of properties possessed by the 
DNA polymerase of the present invention represent a significant advantage over thermostable DNA polymerases pre- 
viously described in the literature. 

Jhe chimeric DNA polymerases of the p^ ent in vention consist of an N-terminal region derived from the 5'-nucle - 
ase domain of a Thermus species DNA polymerase ana a C-terminal reqinn f^prf"*^ ^'"""^ \n 5, exonuclease an? 
.PQ l ymer a sp Joa iains of Tma DNA polymerase. Th e N-terminal region contains at least a region of the Thermus species 
DNA polymerase corresponding to amino acids 1-138 of Tma DNA polymerase and may contain up to the entire 5'- 
nudease domain of the Thermus species DNA polymerase. The C-terminal region contains, in addition to the 3' -to 5'- 
exonuclease and polymerase domains of Tma DNA polymerase, a portion of the 5'-nuclease domain of Tma DNA 
polymerase corresponding to the portion of the 5'-nuclease domain of Thermus species DNA polymerase not present 
in the N-terminal region. 

Thus, the chimeric DNA polymerase of the present invention consists of an N-terminal region and a C-terminal 
region, wherein said N-terminal region consists of amino acids 1 through n of a Thermus species DNA polymerase, 
wherein n is an amino acid position within a region of the Thermus species DNA polymerase corresponding to amino 
acids 1 38-291 of Tma DNA polymerase, and wherein said C-terminal region consists of amino acids m+1 through 893 
of Tma DNA polymerase, wherein amino acid m in Tma DNA polymerase corresponds to amino acid n in the Thermus 
species DNA polymerase when Tma DNA polymerase and the Thermus species DNA polymerase are aligned as in the 
figures. 

The chimeric DNA polymerase of the present invention is modified by a F730Y mutation in tiie DNA polymerase 
domain derived from Tma DNA polymerase, which increases the ability of the DNA polymerase to incorporate dideox- 
ynucleotides. 

In one embodiment, the 5'-nuclease domain of the chimeric DNA polymerase contains at least one point mutation 
that substantially reduces or, preferably, inactivates the 5'-nuclease activity. The mutation can be present either in the 
N-terminal, which is derived from the 5'-nuclease domain of the Thermus species DNA polymerase, or the portion of 
the C-terminal region that is derived from 5'-nuclease domain of Tma DNA polymerase, if present. Suitable mutations 
are those point mutations (single amino acid substitution or deletion mutations) that substantially reduce or, preferably, 
inactivate tfie 5'-nuclease activity in the source DNA polymerase. Thus, either the N-terminal region is modified by at 
least one amino acid substitution or deletion that substantially reduces or eliminates 5'-nuclease activity in the Thermus 
species DNA polymerase, or said C-terminal region is modified by at least one amino acid substitution or deletion within 
the region that is amino acids m+1 to 291 of Tma DNA polymerase that substantially reduces or eliminates 5-nuclease 
activity in Tma DNA polymerase. 

Amino acid positions which are critical to the 5'-nuclease activity of a DNA polymerase are well known, as 
desCTlbed herein, A substitution of an amino acid at one or more of these critical positions or a deletion of an amino acid 
at one or more of these critical positions typically results in a decrease in the 5'-nuclease activity. Preferably, the chi- 
meric DNA polymerase contains a mutation tiiat substantially reduces or inactivates the 5'-nuclease activity. 

In one embodiment, the C-terminal region, which contains the 3'- to 5'- exonuclease domain derived from Tma 
DNA polymerase, contains at least one point mutation that substantially reduces or, preferably, inactivates the 3' to 5' 
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exonuclease activity in Tma DNA polymerase. 

Amino acid positions whicin are critical to the 3' to 5' exonuclease actidty of a DNA polymerase are well known, as 
described herein. A substitution o1 an amino acid at one or more of these critical positions or a deletion of an amino acid 
at one or more of these critical positions typically results in a decrease In the 3'- to 5'-nuclease activity. In a preferred 
5 embodiment, the C-terminal region contains a D323A and a E325A mutation, which inactivate the 3* to 5, exonuclease 
activity. 

In one embodiment, the N-terminal region is derived from Taq DNA polymerase. In a preferred embodiment, the N- 
terminal region consists of amino acids 1-190 of Taq DNA polymerase, and the C-terminal region consists of amino 
acids 191-893 of Tma DNA polymerase. In a particular preferred embodiment, designated F730Yr/T?a30 DNA 
10 Polymerase, the N-terminal region consists of amino acids 1-190 of Taq DNA polymerase and contains a G46D muta- 
tion, and the C-terminal region consists of amino acids 191-893 of Tma DNA polymerase and contains D323A, E325A, 
and F730Y mutations. 

Another aspect of the present invention relates to the purified DNA (chimeric gene) which encodes the mutant, chi- 
meric thermostable DNA polymerase of the invention, recombinant DNA vectors which contain the DNA, and host cells 
15 transformed with the recombinant DNA vectors. DNA sequences which differ only by silent nucleotide changes (i.e., 
which encode the same amino acid sequence) are within the intended scope of the invention. 

In a prefen'ed embodiment of the invention, the purified DNA consists of nucleotides 1-570 of a gene encoding Taq 
DNA polymerase modified to encode the G46D mutation, and nucleotides 571-2679 of a gene encoding Tma DNA 
polymerase modified to encode the D323A. E325A. and F730Y mutations. 
20 Another aspect of the invention relates to methods for preparing the mutant, chimeric thermostable DNA polymer- 
ase of the invention using the purified DNA of the present invention. A recombinant expression vector is expressed in a 
host cell, and the expressed protein is purified from the host cell. 

Brief Description of t he Drawing s 

26 

Figures 1 A and 1 B provide an amino acid sequence alignment of the 5'-nuclease domains of Tma DNA polymer- 
ase and DNA polymerases from seven species of the genus Thermus. Amino acids which are aitical to the 5'- 
nuclease activity are indicated by asterisl<s. 

Figures 2A, 2B, and 2C provide a sequendng trace from the cycle sequencing reaction using F730Yrma30 DNA 
30 Polymerase as described in Example 5. 

Figures 3A, 3B, and 3C provide a sequencing trace from the cycle sequencing reaction using AmpliTaq® DNA 
Polymerase FS as described in Example 5. 

Detailed Description of the Invention 

35 

The present invention provides a mutant chimeric thermostable DNA polymerase and means for producing the 
enzyme. To facilitate understanding of the invention, a number of terms are defined below. 

The terms "cell", "cell line", and "cell culture" can be used interchangeably and all such designations include prog- 
eny Thus, the words "transformants" or "transformed cells" include the primary transformed cell and cultures derived 
40 from that cell without regard to the number of transfers. All progeny may not be precisely identical in DNA content, due 
to deliberate or inadvertent mutations. Mutant progeny that have the same functionality as screened for in the originally 
transformed cell are included in the definition of transformants. 

The term "control sequences'* refers to DNA sequences necessary for the expression of an operably linked coding 
sequence in a particular host organism. The control sequences that are suitable for procaryotes, for exanple, include 
45 a promoter, optionally an operator sequence, a ribosome binding site, positive retroregulatory elements (see U.S. Pat- 
ent No. 4,666,848, incorporated herein by reference), and possibly other sequences. Eucaryotic cells are known to uti- 
lize promoters, polyadenylation signals, and enhancers. 

The term "expression clone" refers to DNA sequences containing a desired ccxling sequence and control 
sequences in operable linkage, so that hosts transformed with these sequences are capable of producing the encoded 
50 proteins. The term "expression system" refers to a host transformed with an expression clone. To effect transformation, 
the expression clone may be included on a vector; however, the relevant DNA may also be integrated into the host chro- 
mosome. 

The term "gene" refers to a DNA sequence that comprises control arxJ coding sequences necessary for the pro- 
duction of a recoverable bioactive polypeptide or precursor. 
55 The term "operably linked" refers to the positioning of the coding sequence such that control sequences will func- 
tion to drive expression of the protein encoded by the coding sequence. Thus, a coding sequence "operably linked" to 
control sequences refers to a configuration wherein the coding sequences can be expressed under the direction of a 
control sequence. 
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The term "oligonucleotide" as used herein is d^ined as a molecule comprised of two or more deoxyribonucleotides 
or ribonucleotides. The exact size will depend on many factors, which in turn depends on the ultimate function or use 
of the oligonucleotide. Oligonucleotides can be prepared by any suitable method, including, for example, cloning and 
restriction of appropriate sequences and direct chemical synthesis by a method such as the phosphotriester method of 
Narang et aL, 1979, Meth. Enzvmol . 68:90-99; the phosphodi ester method of Brown et al., 1979, Meth . Enzvmol . 
68:109-151; the diethylphosphoramtdite method of Beaucage et aL, 1981, Tetrahedron Lett . 22:1859-1862; and the 
solid support method of U.S. Patent No, 4,458,066, each incorporated herein by reference. A review of synthesis meth- 
ods is provided in Goodchiid, 1990, BlQCPniugategliamisti:^l(3): 165-187. incorporated herein by reference. 

The term "primer" as used herein refers to an oligonucleotide which Is capable of acting as a point of initiation of 
synthesis when placed under conditions in which primer extension is initiated. Synthesis of a primer extension product 
which is complementary to a nucleic acid strand is Initiated in the presence of the requisite four different nucleoside tri- 
phosphates and a thermostable DNA polymerase in an appropriate buffer at a suitable temperature. A "buffer" Includes 
cofactors (such as divalent metal ions) and salt (to provide the appropriate ionic strength), adjusted to the desired pH. 

A primer that hybridizes to the non-coding strand of a gene sequence (equivalerttly is a subsequence of the coding 
strand) is referred to herein as an "upstream" primer A primer that hybridizes to the coding strand of a gene sequence 
is referred to herein as an "downstream" primer. 

The terms "restriction endonucleases" and "restriction enzymes" refer to enzymes, typically bacterial in origin, 
which cut double-stranded DNA at or near a specific nucleotide sequence. 

The term "thermostable enzyme", as used herein, refers to an enzyme which is stable to heat and has an elevated 
temperature reaction optimum. The thermostable enzyme of the present invention catalyzes primer extension optimally 
at a temperature between 60 and 90^*0, and is usable under the temperature cycling conditions typically used in cycle 
sequence reactions and polymerase chain reaction amplifications (described in U.S. Patent No. 4,965,188, incorpo- 
rated herein by reference). 

As used herein, a "point mutation" in an amino acid sequence refers to either a single amino acid substitution or 
single amino acid d^etion. A point mutation preferaWy is introduced into an amino acid sequence by a suitable codon 
change in the encoding DNA. 

Individual amino acids in a sequence are represented herein as AN, wherein A is the standard one letter symbol 
for the amino acid in the sequence, and N is the position in the sequence. Mutations within an amino acid sequence are 
represented herein as Ai NA2, wherein A-i is the standard one tetter symbol for the amino acid in the unmutated protein 
sequence, A2 is the standard one letter symbol for the amino acid in the mutated protein sequence, and N is the position 
in the amino acid sequence. For example, a Q46D mutation represents a change from glycine to aspartic acid at amino 
acid position 46. The amino acid positions are numbered based on the full-length sequence of the protein from which 
the region encompassing the mutation is derived. Thus, in the present invention, mutations in the region of the protein 
which are derived from a Thermus species DNA polymerase are numbered according to the full-length Thermus spe- 
cies DNA polymerase sequence, whereas mutations in the region derived from Tma DNA polymerase are nurT4}ered 
according to the full-length Tma DNA polymerase sequence. Representations of nucleotides and point mutations in 
DNA sequences are analogous. 

As used herein, a "chimeric" protein refers to a protein whose amino acid sequence represents a fusion product of 
subsequences of the amirro acid sequences from at least two distinct proteins. A chimeric protein preferably is not pro- 
duced by direct manipulation of amino acid sequences, but, rather, is expressed from a "chimeric" gene that encodes 
the chimeric amino add sequence. The chimeric proteins of the present Invention consist of an amino-terminal (N-ter- 
minal) region derived from a Thermus species DNA polymerase and a carboxy-terminal (C-terminal) region derived 
from Tma DNA polymerase. The N-terminal region refers to a region extending from the N-terminus (amino acid posi- 
tion 1 ) to an internal amino acid. Similarly, the C-terminal region refers to a region extending from an internal amino acid 
to the C-terminus. In the chimeric proteins of the present invention, the N-terminal region extends from the N-terminus 
(amino acid position 1) to the beginning of the C-terminal region, which extends to the G-terminus. Thus, taken together, 
the N-terminal and C-terminal regions encompass the entire amino acid sequence. 

The exonucleolytic activities associated with DNA polymerases (3' to 5" exonuclease activity and S'-nudease activ- 
ity, also referred to as 5' to 3' exonuclease activity) and methods of measuring tiiese activities are well known in the art. 
As used herein, an activity is "substantially reduced" if reduced to less than about 20%, preferably to less than about 
10%, and more preferably to less than about 1%, of the activity present in the unmutated enzyme. An activity is "inaci- 
tivated" or "essentially inactivated" if reduced to a level which is negligible for the purpose of the enzyme's typical or pre- 
ferred use. 

The thermos table DNA polymerase of the Invention 

The typical thermostable DNA polymerase of the present invention is a chimeric DNA polymerase in which the N- 
terminal region consists of an N-terminal region of a Thermus species DNA polymerase and the C-terminal region con- 
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sists of a C-terminal region of Tma DNA polymerase. The N-terminal region irom the Thermos species DNA polymer- 
ase encompasses a portion of, or all of, the S'-nudease domain. The C-terminal region from Tma DNA polymerase 
encompasses a portion, or possibly none, of the S'-nudease domain and the entire 3' to 5' exonuclease and DNA 
polymerase domains. The portion of the 5'-nudease domain of Tma DNA polymerase encompassed by the C-terminal 

5 region of the chimeric protein will correspond to that portion of the 5'-nuclease domain of the Thermus species DNA 
polymerase not encompassed by the N-terminal region of the chimeric protein. 

The chimeric DNA polymerase additionally contains the F730Y mutation, which increases the effidency with which 
the DNA polymerase incorporates ddNTPs. The chimeric DNA polymerase preferably also contains one or more point 
mutations which significantly reduce or eliminate the 5'-nudease activity and one or more point mutations which signif- 

10 icantly reduce or eliminate the 3* to 5* exonuclease activity. 

1 ■ The chimeric prot ein domains 

DNA polymerases from species of the genus Thermus and Tma DNA polymerase are similar in overall structure. 
15 In these DNA polymerases, the exonuclease and DNA polymerase activities of the enzymes are present in discrete 
regions of the protein (the activity domains). The approximate activity domains of a representative Thermus species 
DNA polymerase, Taqr DNA polymerase, and Tma DNA polymerase are shown in the table below (see also U.S. Patent 
No. 5,420,029). The homologous activity domains which encode 5 -nuclease activity, and those which encode DNA 
polymerase activity, are approximately the same length (see Figures 1 A and 1 B). The difference in length between the 
^ 20 region that encodes 3* to 5' exonudease activity in Tma DNA polymerase and the corresponding region in Taq DNA 
polymerase corresponds to the lack of 3' to 5' exonuclease activity in Taq DNA polymerase. 



Activity Domains (approximate amino add positions) 




5'-nudeas6 


3'- to 5'exonuclease 


Polymerase 


Taq DNA polymerase 


1-289 




423-832 


Tma DNA polymerase 


1-291 


292-484 


485-893 



Significant amino add sequence similarity exists between Thermus spedes DNA polymerases and Tma DNA 
polymerase. For example, an amino add sequence comparison of a representative Thermus spedes DNA polymerase, 
Taq DNA polymerase, and Tma DNA polymerase using the GAP computer program (Genetics Computer Group, Mad- 

35 ison. Wl) with the default parameter values, indicates that the amino add sequences are approximately 44% identical 
and 66% similar over either the entire amino add sequences or over the 5'-nudease domains. 

Because of the overall structural and sequence similarity the chimeric enzyme preserves the overall structure and 
activity domains present in Tma DNA polymerase. The essential change is that the amino add sequence of the N-ter- 
minal region of the chimeric enzyme is that of the corresponding region of a Thermus species DNA polymerase. Thus, 

40 the chimeric enzyme of the present invention corresponds to a mutated Tma DNA polymerase, wherein the 5'-nudease 
domain has been replaced by the corresponding domain from a Thermus species DNA polymerase. The "correspond- 
ing domain" is defined herein by an amino acid sequence alignment, as provided in the figures. 

Figures 1 A and IB provide an amino acid sequence alignment of the S'-nuclease domains of Tma DNA polymer- 
ase and seven representative Thermus species DNA polymerases. The seven representative Thermus species DNA 

46 polymerases are listed in the table below, along with the abbreviations used herein and the sequence identification 
numbers for the amino acid sequences of the 5'-nuclease domains. 



Abbreviation 


Species 


Sequence of the 5'- 
Nuclease Domain 


Tma 


Thermatoga maritima 


(SEQ ID N0:1) 


Taq 


Thermus aquaticus 


(SEQ ID NO: 2) 


Tfl 


Thermus flavus 


(SEQ ID NO: 3) 


Tth 


Thermus thermophifus 


(SEQ ID NO: 4) 



6 

03/15/2002, EAST Version: 1.03.0002 



EP0892 058 A2 



(continued) 



Abbreviation 


Species 


Sequence of the 5'- 
Nuclease Domain 


TZ05 


Thermus species Z05 


(SEQ ID NO: 5) 


Tea 


Thermus caldofilus 


(SEQ ID NO: 6) 


Tsps17 


Thermus species sps 1 7 


(SEQ ID NO: 7) 


Tfi 


Thermus filiformis 


(SEQ ID NO: 8) 



The correspondence of amino acids and regions within these DNA polymerases is obtained from the amino acid 
sequence alignment. As used herein, amino acids "correspond" if they are aligned in the sequence alignment of Figures 
1A and 1B. Thus, correspondence refers both to amino acids which are identical (conserved) among the sequences 
and to amino acids which are not identical, but which are aligned to maximize overall sequence homology 

A number of additional species of the genus Thermus have been identified and are available from depositories 
such the American Type Culture Collection (ATCC) and the Deutsche Sammlung von Mikroorganismen (DSM). As dis- 
cussed below, DNA polymerases and the encoding genes can be recovered from the deposited strains and sequenced 
in a routine manner. A routine sequence alignment of the amino acid sequence of a Thermus species DNA polymerase 
sequence with the Tma DNA polymerase sequence using, for example, the GAP program, will enable the use of the 
Thermus DNA polymerase sequence in a chimeric DNA polymerase of the present invention. 

In the chimeric protein of the Invention, the first amino acid of the region from Tma DNA polymerase will begin with 
the amino acid following the amino acid that corresponds to the last amino acid of the Thermus species DNA polymer- 
ase sequence and will contain the rest (through amino acid 893) of the Tma DNA polymerase sequence. The sequence 
of the entire Tma DNA polymerase is provided as SEQ ID NO: 10. Preferably, the amino acid sequence from the 
Thermus species DNA polymerase is joined to an amino acid sequence from Tma DNA polymerase at a point where 
the two amino acid sequences are identical or similar. For example, a preferred embodiment consists of amino acids 1- 
190 from Tag DNA polymerase and amino acids 191-893 of Tma DNA polymerase. Amino acid 190 of Tma DNA 
polymerase corresponds to amino acki 190 of Taq DNA polymerase, and the Tma DNA polymerase portion of the chi- 
meric enzyme begins with the next amino acid, amino acid 191 . 

In regions where the two DNA polymerases are identical, identification of the last amino acid from the Thermus 
species DNA polymerase is arbitrary within the region. For example, because amino acids 191 and 192 are identical in 
Taq DNA polymerases and Tma DNA polymerases (and conserved among Thermus species DNA polymerase), a chi- 
meric protein that contains amino acids 1 -1 90 of Taq DNA polymerase is indistinguishable from chimeric proteiris con- 
taining amino acids 1-191 or 1-192 of Taq DNA polymerase. The embodiment of the invention described in the 
exanrples is referred to as containing amino acids 1 -1 90 of Taq DNA polymerase in view of the original derivation of the 
enzyme. 

In the sequence alignment provided in Figures 1A and IB, gaps one amino acid in length were inserted into the 
Tma DNA sequence at positions 54-55 and 225-226 to allow alignment with five of seven of the Thermus species DNA 
polymerases which contain an additional amino acid at these positions. Consequently, for these two amino acids 
present in these five Thermus species, there are no corresponding amino adds in Tma DNA polymerase. One of skill 
in the art will realize that a suitable chimeric protein containing a N-terminal region from one of these five Thermus spe- 
cies DNA polymerases that ends with an amino acid which is aligned with a gap in Tma DNA polymerase can be con- 
structed in which the Tma DNA polymerase-derived region starts at the first amino acid following the gap. 

A critical aspect of the chimeric DNA polymerase is that it is encoded by a chimeric gene in which the region encod- 
ing the Tma DNA polymerase sequence through at least the alternative ribosomal binding site present at about codons 
133-137 in the full-length Tma DNA polymerase gene, and preferably through the methionine 140 start codon, is 
replaced by a gene sequence encoding the corresponding region from a Thermus species DNA polymerase. The pres- 
ence in the full-length Tma DNA polymerase gene of this alternative ribosomal binding site and start codon results in 
the preferential expression of a truncated Tma DNA polymerase starting with amino acid 140. As described below, 
replacement of this region of the Tma DNA polymerase gene is artical to the efficient expression of the full-length chi- 
meric protein. Thus, in the chimeric DNA polymerase of the Invention, the N-terminai region from a Thermus species 
DNA polymerase replaces a region of Tma DNA polymerase that encompasses at least through amino acid 137, and 
preferably through amino acid 1 40. 

The region of each Thermus species DNA polymerase that corresponds to amino acids 1-137 of Tma DNA 
polymerase is obtained from an amino acid sequence alignment, as provided in the figures. For example, the region of 
Taq DNA polymerase that corresponds to amino acids 1-137 of Tma DNA polymerase Is amino acids 1-142 (see Fig- 
ures 1 A and 1 8), and the amino acid of Taq DNA polymerase that corresponds M140 of Tma DNA polymerase is L145. 
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Thus, embodiments in which the N-terminal region is from Taq DNA polymerase will comprise at least amino acids 1- 
142 and preferably, amino acids 1-145 of Taq DNA polymerase. Similarly, for embodiments in wNch the N-terminal 
region is from another Thermus species DNA polymerase, the region of the Thermus species DNA polymerase that 
corresponds to amino acids 1 -137 and 1 40 of Tma DNA polymerase is obtained from the sequence alignment provided 
in the figures. 

One of skill in the art will recognize that minor mutations, additions, or deletions can be introduced into a DNA 
polymerase that do not alter the functional properties of the enzyme, and that such a mutated enzyme is equivalent, for 
all intents and purposes, to the unmutated enzyme. For example, it is known that a deletion in Taq DNA polymerase of 
several N-terminal amino acids does not alter the functional properties of the enzyme. Similarly, it is known that substi- 
tution mutations at many of the amino acid positions appear to have essentially no affect. For the purposes of the 
present invention, DNA polymerases which contain minor mutations that do not alter the functional properties of the 
enzyme are considered to be equivalent to the unmutated DNA polymerase, 

2. Point mutations in th e S'-nuclease domain 

In one embodiment, the 5'-nuclease domain of the chimeric DNA polymerase contains one or more point mutations 
(single amino acid substitution or deletion mutations) which reduce or eliminate the 5*-nuclease activity. Because the S'- 
nuclease domain of the chimeric protein contains portions derived from a Thermus species DNA polymerase and, in 
most embodiments, from Tma DNA polymerase, mutations which substantially reduce or eliminate the S'-nuclease 
activity may be introduced either into the Thermus species DNA polymerase-derived portion or the Tma DNA polymer- 
ase-derived portion. 

Based on amino acid sequence alignments, DNA polymerases have been classified into groups, designated fami- 
lies A, B. and C, according to the homology with E coli DNA polymerases I. II, and III (see, for example, Ito and 
Braithwaite, NucL Asids BfiS. 19(15):4045-4-47, incorporated herein by reference). The Tma and Thermus species 
DNA polymerases are members of the family A DNA polymerases, which are related to E. coli DNA polymerase I. 
Amino acids which are conserved among family A DNA polymerases and which are critical to S'-nuclease activity of the 
DNA polymerases have been identified (see, for example, Gutman etal. 1993, Nucl. Acids. Res. 21:4406-4407, incor- 
porated herein by reference). Because of the conservation of amino acids which are critical for S'-nuclease activity in 
family A DNA polymerases, the identification of critical amino ackJs in one DNA polymerase, such as £ coli DNA 
polymerase I or Taq DNA polymerase, allows identification of critical amino acids in other family A DNA polymerases 
based on a sequence alignment, such as provided in Figures 1 A and 1 B. Critical amino acids can be identified in addi- 
tional Thermus species DNA polymerases from a routine sequence alignment with the sequences provided herein. 

Amino acids that have been identified as critical to S'-nuclease activity are indicated in Figures 1 A and 1 B with an 
asterisk. The positions of the critical amino acids within each DNA polymerase is obtained from the alignment. For 
example, referring the Taq DNA polymerase sequence, (SEQ ID NO: 2). these critical amino acids are as follows: D18 
R25, G46, D67, F73. R74, Y81. Q107, E117. D119. D120. D142. D144. Q187, D188. D191, and Q195. 

it would not be surprising if additional critical amino acids are identified in the future. As mutations at these amino 
acid positions as described herein would result in a reduction or eliminating of the S'-nuclease activity, such mutations 
would be suitable for use in the present invention. 

In general, to reduce or eliminate S'-nuclease activity, one or more of these amino acid positions is either deleted 
or mutated to an amino acid having a different property. For example, an acidic amino acid such as Asp (D) may be 
changed to a basic (Lys. Arg, His), neutral (Ala, Val, Leu. lie. Pro, Met. Phe. Trp), or polar but uncharged amino acid 
(Gly, Ser, Thr, Cys, Tyr, Asn, or Gin). The preferred G46D mutation substitutes the acidic Asp for the polar but 
uncharged Gly. In general, mutations to Ala or Gly are preferable to minimize distortion of the protein structure. 

Substitution mutations which preserve the charge property of the amino acid also may attenuate tiie S'-nuclease 
activity. For example, U.S. Patent S,474,920, incoT3orated herein by reference, desaibes three mutations in the Taq 
DNA sequence which reduce or eliminate the S'-nuclease activity. Although one of the mutations. R2SC (basic to polar 
but uncharged), results in a change to an amino acid having a different property, two of the mutations: F73L (neutral to 
neuti*a!) and R74H (basic to basic), do not result in a change in property. Nevertheless, all three mutations attenuate the 
S'-nuclease activity. Particular mutations at each critical amino acid position which affect the S'-nuclease activity can be 
determined routinely by mutating the DNA polymerase and measuring the resulting activity A sensitive and convenient 
assay is described in U.S. Patent S.466,591, incorporated herein by reference. 

In a preferred embodiment, tiie S'-nuclease domain of the chimeric DNA polymerase contains a mutation corre- 
sponding to a G46D mutation in Taq DNA polymerase, which reduces the S'-nuclease activity at least 1000-fbId (see 
U.S. Patent 5,466.S91). 

Mutations in the amino acid sequence are achieved by incorporating appropriate mutations in the encoding gene 
sequence. Such mutations in the DNA sequence are carried out using techniques well known in ttie art, as described 
further, below. 
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3. Point mutations in the 3' to 5' exonudease domain 

In one embodiment, the 3' to 5' exonudease domain of the chimeric DNA polymerase contains one or more point 
mutations (single amino acid substitution or deletion mutations) which reduce or eliminate the 3' to 5' exonudease activ- 
ity. The 3' to 5' exonudease domain of the chimeric protein is contained within the Tma DNA polymerase-derived por- 
tion. Thus, suitable mutations are those which substantially reduce or eliminate the 5'-nuclease activity of Tma DNA 
polymerase. 

Three amino acid "motifs" critical for 3' to 5* exonudease activity in Tma DNA polymerase, along with the critical 
amino acids within each motif, have been identified (see U.S. Patent No. 5,420,029). The critical amino adds are listed 
below. Mutations of one or more of these amino acids which reduce the 3' to 5' exonudease activity in Tma DNA 
polymerase may be used in the DNA polymerases of the present Invention. 



Tma DNA polymerase Amino Acids 


Critical to 3' to 5' exonudease Activity 


Motif 


Critical Amino acids 


A 


D323, E325, L329 


B 


N385, D389, L393 


C 


Y464, D468 



It would not be surprising if additional critical amino acids are identified in the future. As mutations at these amino 
acid positions as desaibed herein would result in a reduction or eliminating of the 3' to 5* exonudease activity, such 
mutations would be suitable for use in the present invention. 

As described above for the reduction of 5 -nudease activity, reduction or elimination of 3' to 5* exonudease activity 
is achieved by a substitution or deletion mutation at one or more of these critical amino add positions, preferably a sub- 
stitution mutation to an amino add having a different property. In the preferred embodiment, the 3* to 5* exonudease 
domain of Tma DNA polymerase is mutated by D323A and E325A mutations, which together essentially eliminate the 
3' to 5' exonudease activity. 

Mutations in the amino acid sequence are achieved by incorporating appropriate mutations in the encoding gene 
sequence. Such mutations in the DNA sequence are carried out using techniques well known in the art, as described 
further below. 

Advantages of the DNA polymerase of the invention 

The chimeric thermostable DNA polymerase of the invention represents a significant improvement over thermosta- 
ble DNA polymerases described in the literature. In particular, the DNA pdymerase of the invention provides the follow- 
ing combination of properties: i 
improved incorporation of ddNTPs; 

- improved uniformity of peak heights in DNA sequencing traces, in particular when used with dye-labeled ddNTPs 
in a cyde sequendng reaction; 

- reduced rate of pyrophosphorolysis of dye-labeled ddNTPs; and 

- improved incorporation of dITP. 

- Furthermore, the DNA polymerase can be easily and efficiently expressed to a high level in a recombinant expres- 
sion system, thereby facilitating commercial production of the enzyme. 

The combination of properties possessed by the DNA polymerase of the invention is particularly useful in dye-ter- 
minator cyde sequencing reactions, and provides significantly improved results. Each of these properties is discussed 
below. 

1. Improved incorporation of ddNTPs 

The chimeric DNA polymerase of the present invention contains the F730Y mutation, which is known to increase 
the efficiency of incorporation of ddNTPs. 
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By comparison, AmpliTaq® DNA polymerase FS is a mutated form of Tag DNA polymerase that contains the anal- 
ogous mutation (F667Y). AmpliTaq® DNA polymerase FS also exhibits an increased efficiency of incorporation of 
ddNTPs, but lacks several the other properties exhibited by the DNA polymerase of the present invention. 

2. Improved uniformity of peak heights in DNA seouencina traces 

An advantageous property of the DNA polymerase of the present invention is that when used in a dye-terminator 
cyde sequencing reaction, rt results in uniform peak heights in the sequencing trace (also referred to as chromatograms 
or electropherograms). Uneven peak heights can decrease the accuracy of base calling and make mutation and poly- 
morphism detection more difficult. 

Unevenness of peak heights in dye-terminator cyde sequendng reactions is a problem that previously had not 
been solved. For example, although AmpliTaq® DNA Polymerase FS incorporates ddNTPs more effidently than does 
unmutated Taq DNA polymerase, the peak height patterns obtained in dye-terminator sequendng reactions are uneven 
(see Parker ef a/., 1996. BioTechniques £l(4):694-699, incorporated herein by reference). The unevenness results at 
least partially from a dependence of peak height on the sequence context. For example, the peak height obtained from 
a G following an A can be extremely small, making an accurate base call difficult. Conversely, the peak height obtained 
from an A following an G can be very high. Particularly problematical patterns include G after A or C, A after A or C, and 
T after T, which can result in very low peak heights. Very high peak heights, such as results from A after G, are less 
problematical alone, but can render adjacent low signals unreadable. 

As shown in the examples, the use of the chimeric DNA polymerase of the invention in cyde sequencing reactions 
results in significantly more uniform peak heights than obtained using AmpliTaq® DNA Polymerase FS. The improved 
uniformity in peak height results in a significant increase in the accuracy of base calling and niakes mutation and poly- 
morphism detection easier. 

3. Rediiced rate of pyrophosphorolvsis of dve-labeled ddNTPs 

DNA polymerases catalyze the template-dependent incorporation of a deoxynucleotide onto the 3'-hydroxyl termi- 
nus of a primer, with the concomitant release of inorganic pyrophosphate (PPi). This polymerization reaction is revers- 
ible. DNA polymerases also catalyze the reverse reaction, pyrophosphorolysis, which is the degradation of DNA in the 
presence of PPi. The reaction is summarized below: 

DNAn + dNTP <r> DNAp^i + PPi 

Inorganic pyrophosphatase (PPase), also known as pyrophosphate phosphohydrolase, catalyzes hydrolysis of 
inorganic pyrophosphate (PPi) to two molecules of orthophosphate. PPase plays an vital role in RNA and DNA synthe- 
sis in vivo. By cleaving PPi, the enzyme shifts the overall equilibrium in favor of synthesis. 

Pyrophosphorolysis can be detrimental to DNA sequendng reactions. Accuracy in DNA sequencing reactions 
depends on precise band position, a decrease in size of only one nucleotide can result in gel artifacts such as reduced 
or missing bands. Pyrophosphorolysis results in the removal of bases from the 3'- end of the primer extension product. 
Furthermore, removal of the incorporated terminal ddNMP (dideoxynucleosidemonophosphate) from a ddNMP-termi- 
nated fragment allows subsequent extension, which leads to signal strength reduction at the affected position and a 
reduced or missing peak in the electropherogram. 

Thus, it is desirable to minimize the pyrophosphorolysis reaction in sequencing reactions. The addition of PPase to 
the reaction shifts the overall equilibrium in favor of synthesis by cleaving PPi. The use of PPase to improve sequencing 
reactions is described in Tabor and Richardson, 1990, J. Biol. Chem. 265(14):8322-8328; and in PCT Patent Publica- 
tion No. WO 90/1 2111; both incorporated herein by reference. The commerdally available cycle sequencing kits from 
PerWn Elmer (Nonwalk, CT), which contain AmpliTaq® DNA Polymerase FS, contain PPase to reduce pyrophosphorol- 
ysis. 

Surprisingly, cycle sequencing reactions using the DNA polymerase of the present Invention are much less affected 
by pyrophosphorolysis of the dye-labeled ddNTP terminators. As described in the examples, cyde sequencing reac- 
tions carried out with a range of PPase concentrations from 0 to 20 units yielded essentially identical results. Thus, the 
DNA polymerase of the present invention appears to greatiy reduce or eliminate the need for PPase in cycle sequenc- 
ing reactions. 

4. Improved incorporation of dITP 

In a typical cycle sequencing reaction, dITP is used instead of dGTP in order to relieve compressions in G/C-rich 
regions. Incorporation of dITP into DNA reduces the denaturation temperature and facilitates denaturation of secondary 



10 

03/15/2002, EAST Version: 1.03,0002 



EP0 892 058 A2 



structure. Because DNA polymerases discriminate against dlTR which is an unconventional nucleotide, the relative 
concentration of dITP must be substantially increased in a reaction to obtain adequate incorporation. For example, in 
the reaction conditions optimized for AmpllTaq® DNA Polymerase FS, dITP is present at a concentration five-fold 
greater than the concentrations of dATP, dCTR and dTTP. 
s In contrast, the DNA polymerase of the present invention incorporates dITP more efficiently, which allows the reac- 
tion to be carried out with more uniform dNTP concentrations. As described in the examples, a dITP concentration of 
only about two- to three-fold greater than the concentrations of dATP, dCTP. and dTTP is optimal for the DNA polymer- 
ase of the present invention. 

10 5. Efficiency of expression 

As described above, the chimeric enzyme of the present invention corresponds to a mutated Tma DNA polymer- 
ase, wherein the 5 -nuclease domain has been replaced by the corresponding domain from a Therms species DNA 
polymerase. The enzyme is expressed from a chimeric gene which corresponds to a mutated Tma DNA polymerase 
15 gene, wherein the region of the gene that encodes the 5'-nuclease domain has been replaced by the corresponding 
region of the Thermus species DNA polymerase gene. A significant advantage of the chimeric gene is that it enables 
the expression of a full-length DNA polymerase in a recombinant expression system much more efficiently than is pos- 
sible from the Tma DNA polymerase gene. 

The expression of a full-length DNA polymerase from a recombinant expression system containing the full-length 
20 natural Tma DNA polymerase gene sequence is problematical because of the preferential expression of a truncated 
form of the protein (see U.S. Patent No. 5,420,029). The truncated protein, referred to as Met140 Tma, consists of 
amino adds 140-893 of the full-length protein and appears to result from translation beginning at the methionine at posi- 
tion 140. The presence of a putative ribosomal binding site at codons 133-137 further suggests that the truncated pro- 
tein results from translation beginning at the internal methionine. The preferential expression of the Met140 Tma 
25 truncated protein represents a significant difficulty in expressing arxl purifying a full-length Tma DNA polymerase. 

The chimeric DNA polymerase gene contains a Thermus species DNA polymerase gene sequence in a region cor- 
responding at least through the alternative ribosomal binding site present at about codons 133-137 in the full-length 
Tma DNA polymerase gene, and preferably through the internal start codon, codon 140. Thus, the Tma DNA polymer- 
ase gene sequence up through the region containing the ribosomal binding site and, preferably, the start codon respon- 
se sible for the translation of Metl 40 Tma, is replaced by the corresponding region of a Thermus species DNA polymerase 
gene. The corresponding region of a Thermus species DNA polymerase gene does not provide for the undesirable 
internal initiation of a truncated protein. As a result, a recombinant expression system containing the chimeric DNA 
polymerase gene expresses a full-length chimeric DNA polymerase exclusively 

35 Preparation of the DNA polymerase of the inventinn 

The DNA polymerase of invention is a chimeric enzyme that consists of a portion derived from a Thermus species 
DNA polymerase and a portion derived from ri77a DNA polymerase. The chimeric enzyme is prepared from a chimeric 
gene, i.e., a DNA that encodes the chimeric enzyme and consists of a portion derived from the Thermus species DNA 

40 polymerase gene and a portion derived from the Tma DNA polymerase gene. The chimeric gene is produced from the 
Thermus species DNA polymerase gene and the Tma DNA polymerase gene using standard gene manipulatfon tech- 
niques well known in the field of molecular biology, as described in detail below. 

The gene encoding Tma DNA polymerase is described in U.S. Patent Nos. 5,420,029 and 5,466,591. The nucle- 
otide sequence of the Tma DNA polymerase gene, as well as the full amino acid sequence of the encoded protein, are 

45 described therein. Example 5 of the '029 patent describes the construction of a variety of plasmids containing the full- 
length Tma DNA polymerase gene starting with plasmids pTmaOl (deposited as Escherichia coli DG101, 
pBSM:TmaXma7-1 under ATCC No. 68471 on November 7, 1990; redeposrted as ATCC No. 98764 on May 22, 1998) 
and pTma04 (deposited as Escherichia coli DG101. pBSM:TmaXma1M delta Ba/Bgl under ATCC No. 68472 on 
November 7, 1990; redeposited as ATCC No. 98765 on May 22. 1998), such as plasmids pTma12-1 and pTmalS. Any 

50 of these expression vectors is suitable as a source of the Tma DNA polymerase gene. 

Genes encoding DNA polymerases from a number of Thermus species, including the nucleotide sequence of the 
DNA polymerase gene and the amino acid sequence of the encoded protein, have been described. A nuntDer of these 
genes are obtainable from publicly available plasmids. The genes from additfonal Thermus species are obtainable from 
the host organisms using methods described in U.S. Patent Nos. 5,079,352; 5,618.711; 5.455.170; 5.405,774; and 

55 5,466,591 ; each incorporated by reference. 

The gene encoding Taq DNA polymerase is described in U.S. Patent Nos. 5,079,352 and 5,466,591. The nucle- 
otide sequence of the Taq DNA polymerase gene, as well as the full amino acid sequence of the encoded fxotein, are 
described therein. Examples V-VII of the *352 patent describes the construction of a variety of expression plasmids con- 
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taining the full-length Taq DNA polymerase gene starting with plasmids pFC83 (ATCC 67422 deposited on May 29. 
1987; redeposited as ATCC No. 98763 on May 22, 1998) and pFC85 (ATCC 67421 deposited on May 29. 1987; rede- 
posited as ATCC No. 98762 on May 22, 1998), such as plasmids pLSPI , pLSG2. pSYCl578, pLSG5, and pLSG6. Any 
of these expression vectors is suitable as a source of the Taq DNA polymerase gene. 

The gene encoding Tih DNA polymerase, methods for obtaining the gene, and expression plasmids containing the 
gene are described in U.S. Patent No. 5,618,711 and 5,466,591. 

The gene encoding TZ05 DNA polymerase, methods for obtaining the gene, and expression plasmids containing 
the gene are described in U.S. Patent No. 5,455,170 and 5,466,591. 

The gene encoding Tsps 7 7 DNA polymerase, methods tor obtaining the gene, and expression plasmids containing 
the gene are described In U.S. Patent No. 5,405,774 and 5,466.591 . 

The Tfl DNA polymerase gene is described in Akhmetzjanov and Vakhitov, 1992, Nucleic Acids Research 
20(21):5839, incorporated herein by reference. 

The Tfi DNA polymerase gene can be recovered from ATCC 43280 using the methods described in the referenced 
patents (see also 1984, FEMS Microbiol. Lett. 22:149-153 (1984)}. 

The Tea DNA polymerase gene is described in Kwon, 1997, MoL Cells 7(2): 264-271 ; and the nucleotide sequence 
is available under EMBLyGenBank Accession No. U62584. 

Additional Thermus species DNA polymerase genes can be recovered using techniques desaibed in the above 
cited patents from the following ATCC deposits: ATCC 4381 4 and 4381 5 (see Alfredsson, 1 986, Ap pl. Environ. Micro- 
bioL 52:1313-1316): ATCC 27978 (see Ramaley 1970 i Bacteriol. 114:556-562; 1973; ibid. 103:527-528); ATCC 
31 674 (see U.S. Patent Nos. 4,442.21 4 and 4,480,036); ATCC 35948 (7: ruber, see Loginova 1 984, Jni i Syst. Bacte- 
iM- 34:498-499). All references are incorporated herein by reference. 

Additional Tiiermus species can be recovered using techniques described in the above cited patents from the fol- 
lowing Deutsche Sammlung von Mikroorganismen (DSM) deposits: DSM:1279 (NUM: 2244) (see Loginova, et a!., 
1975. Izv. Akad. Nauk SSSR Ser. Biol.: 304-307); DSM:579; DSM:625 (NUM: 2248) (see Degryse etaL, 1978. Arch! 
MigrgbiQl, iaS:196): DSM: 1279 (NUM: 3844) (see Loginova et aL, 1984, JdL i SiSL SaetedQL:498-499); and 
DSM:625(NUM: 1002) (see Brock and Freeze, 1969. J. Bacteriol .: 289-297). All references are incorporated herein by 
reference. 

Additional Tiiermus species which have been described include I oshimai (see Williams et ai., 1 996, \nL iL Syst. 
Bacteriol, 46(2):403-408); I silvanus and T chliarophilus (see Tenreiro et al. 1995, JdL J. Syst. Bacteriol. 45(4) :633- 
639); T. scotoductus (see Tenreiro a/., 1995. BfiS. Microbiol. 146(4):315-324): and I ruber (see Shadrina et al., 
1982, Mikrobiolooiia 51f4):611-615): all incorporated herein by reference. 

Following the guidance provided herein, and using only well known techniques, one skilled in the art will be able to 
prepare from the DNA polymerase genes any number of expression vectors containing a chimeric gene suitable for 
expressing the chimeric DNA polymerases of the invention in any of a variety of host systems. 

In a preferred embodiment, the chimeric enzyme of the invention consists of amino acids 1-190 from Taq DNA 
polymerase and amino acids 191-893 from Tma DNA polymerase, both regions suitably mutated to eliminate associ- 
ated exonuclease activity. This preferred embodiment can be constructed directly from the Taq DNA polymerase and 
Tma DNA polymerase genes, either obtained from the deposited plasmids described above or recovered from the host 
organisms. However, such chimeric DNA polymerases can be most easily constructed from plasmid pUC18:Tma25, 
which was deposited with the ATCC under accession No. 98443 on May 28, 1997. 

Plasmid pUCi 8:Tma25 contains a chimeric gene that encodes a chimeric protein consisting of amino acids 1 -1 90 
from Taq DNA polymerase and amino acids 191-893 of Tma DNA polymerase. The chimeric protein encoded by 
pUC18:Tma25 contains the G46D mutation in the Taq DNA polymerase portion. The nucleotide sequence of the chi- 
meric gene of pUC18:Tma25 is provided as SEQ ID NO: 9. 

Suitable expression systems are constructed from pUC18:Tma25 by sub-cloning the chimeric gene into a suitable 
expression vector, introducing one or more point mutations which attenuate or eliminate the 3' to 5' exonuclease activity 
of the encoded protein, and introducing the F730Y mutation in the Tma DNA polymerase portion. The construction of 
a preferred expression system, which encodes a chimeric protein containing a G46D mutat'on in 5'-nuclease domain, 
D323A and E325A mutations in the 3' to 5' exonuclease domain, and a F730Y mutation in the Tma DNA polymerase 
portion, is described in the examples. 

The nucleotide sequence of pUC18:Tma25 that encodes amino acids 1-190 of Taq DNA polymerase was derived 
from plasmid pRDA3-2, described in U.S. Patent No. 5,466,591 . and, thus, encodes an amino acid sequence containing 
the G46D mutation described therein. The nucleotide sequence of pRDA3-2 and. hence, pUC18:Tma25, also contains 
additional mutations relative to the native Taq DNA polymerase gene sequence (SEQ ID NO: 9) which are silent, i.e., 
do not alter the amino acid sequence encoded. 

Because of the redundancy in the genetic code, typically a large number of DNA sequences encode any given 
amino acid sequence and are, in this sense, equivalent. As described below, it may be desirable to select one or 
another equivalent DNA sequences for use in a expression vector, based on the prefen-ed codon usage of the host cell 
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into which the expression vector will be inserted. The present invention Is Intended to enoonpass all DNA sequences 
which encode the chimeric enzyme. Thus, cNmeric genes of the present invention are not limited to containing only 
sequences from the wild-type Thermus species and Tma DNA polymerase genes, but can contain any of the DNA 
sequences which encode a chimeric DNA polymerase of the present invention. 

Production off the enzyme of the invention Is carried out using a recombinant expression clone. The construction of 
the recombinant expression clone, the transformation of a host cell with the expression clone, and the culture of the 
transformed host cell under conditions which promote expression, can be carrlaJ out in a variety of ways using tech- 
niques of molecular biology well understood in the art. Methods for each of these steps are described in general below. 
Preferred methods are described in detail in the examples. 

An operable expression clone Is constructed by placing the coding sequence In operable linkage with a suitable 
control sequences in an expression vector. The vector can be designed to replicate autonomously in the host cell or to 
Integrate Into the chromosomal DNA of the host cell. The resulting clone Is used to transform a suitable host and the 
transformed host is cultured under conditions suitable for expression of the coding sequence. The expressed protein is 
isolated from the medium or from the cells, although recovery and purification off the protein may not be necessary in 
some Instances. 

Construction of suitable clones containing the coding sequence and a suitable control sequence employs standard 
ligation and restriction techniques that are well understood in the art. In general, isolated plasmids, DNA sequences, or 
synthesized oligonucleotides are cleaved, modified, and religated in the form desired. Suitable restriction sites can, if 
not normally available, be added to the ends of the coding sequence so as to facilitate construction of an expression 
clone. 

Site-specific DNA cleavage is performed by treating with a suitable restriction enzyme (or enzymes) under condi- 
tions that are generally understood in the art and specified by the manufacturers of commercially available restriction 
enzymes. See, e.g.. product catalogs from Amersham (Arlington Heights, IL), Boehringer Mannheim (Indianapolis, IN), 
and New England Biolabs (Beverly. MA). In general, about 1 ^ig of plasmid or other DNA is cleaved by one unit of 
enzyme in about 20pJ of buffer solution; in the examples below, an excess of restriction enzyme is generally used to 
ensure complete digestion of the DNA. Incubation times of about one to two hours at a temperature which is optimal for 
the particular enzyme are typical. After each incubation, protein is removed by extraction with phenol and chloroform; 
this extraction can be followed by ether extraction and recovery of the DNA from aqueous fractions by precipitation with 
ethanol. If desired, size separation of the cleaved fragments may be performed by polyacrylamide gel or agarose gel 
electrophoresis using standard techniques. See, e.g., Maxam et al.. Methods in Enzvmoiogy. 1980, 65:499-560. 

Restriction-cleaved fragments with single-strand "overhanging" termini can be made blunt-ended (double-strand 
ends) by treating with tiie large fragment off E. coli DNA polymerase I (Klenow) in the presence of tiie four deoxynucle- 
oside triphosphates (dNTPs) using incubation times of about 15 to 25 minutes at 20°C to 25^C in 50 mM Tris. pH 7.6, 
50 mM NaCI, 10 mM MgCl2, 10 mM DTT. and 5to 10 ^M dNTPs. The Klenow fragment fills in at 5' protruding ends, but 
chews back protruding 3' single strands, even though the four dNTPs are present. If desired, selective repair can be per- 
formed by supplying one or more selected dNTPs, within the limitations dictated by tiie nature of tiie protruding ends. 
After treatment with Klenow. tfie mixture is extracted with phenol/chloroform and etiianol precipitated. Similar results 
can be achieved using SI nuclease, because treatment under appropriate conditions with SI nuclease results in 
hydrolysis of any single-stranded portion of a nucleic acid. 

Ligations are performed In 1 5-30 ^1 volumes under the following standard conditions and temperatures: 20 mM Tr is- 
Cl, pH 7.5. 10 mM MgCIa, 10 mM DTT, 33 ^ig/ml BSA, 10-50 mM NaCI. and either 40 \iM ATP and 0.01-0.02 (Weiss) 
units T4 DNA ligase at 0°C (for ligation of fragments with complementary single-stranded ends) or 1 mM ATP and 0.3- 
0.6 units T4 DNA ligase at U^'C (for "blunt end" ligation). Intermolecular ligations of fragments witfi complementary 
ends are usually performed at 33-100 \iQ/m\ total DNA concentrations (5-100 nM total ends concentratfon). Intermo- 
lecular blunt end ligations (usually employing a 20-30 fold molar excess of linkers, optionally) are performed at 1 ^iM 
total ends concerrtration. 

In vector consf uction, the vector fragment is commonly treated with bacterial or calf intestinal alkaline phosphatase 
(BAP or CI AP) to remove the 5' phosphate and prevent religation and reconstruction of the vector. BAP and CIAP diges- 
tion conditions are well known in the art, and published protocols usually accompany the commercially available BAP 
and CIAP enzymes. To recover the nucleic acid fragments, the preparation is extracted with phenol-chloroform and eth- 
anol precipitated to remove the phosphatase and purify the DNA. Alternatively, religation of unwanted vector fragments 
can be prevented by restriction enzyme digestion before or after ligation, if appropriate restrictfon sites are available. 

In the construction set forth below, correct ligations for plasmid construction are confirmed by first transforming a 
suitable host, such as E coli strain DG101 (ATCC 47043) or E. coli strain DQ1 16 (ATGC 53606). with the ligation mix- 
ture. Successful ti-ansformants are selected by ampicillin, tetracycline or otiier antibiotic resistance or sensitivity or by 
using other markers, depending on the mode of plasmid construction, as is understood in tiie art. Plasmids from the 
transformants are then prepared according to the metiiod of Clewell et ai, 1 969, Proc. Nat. Acad. Sci. USA 62:1 159, 
optionally folfowing chloramphenicol amplification (Clewell, 1972, J. Bacteriol . im:667). Alternatively, plasmid DNA cari 
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be prepared using the "Base-Acid" extraction method at page 11 of the Bethesda Research Laboratories publication 
Eocua 5(2). and very pure plasmid DNA can be obtained by replacing steps 12 through 17 of the protocol with 
CsCI/ethidium bromide ultracentrifugation of the DNA. The Isolated DNA is analyzed by restriction enzyme digestion 
and/or sequenced by the dideoxy method of Sanger et al., 1977, Proc . Natl . Acad . Sci. USA 74:5463 as further 
described by Messing et al., 1981 , Nuc- Acids Res. 9:309, or by the method of Maxam et a/., 1980, Methods jn Enzv- 
moloav 65:499. 

The control sequences, expression vectors, and transformation methods are dependent on the type of host cell 
used to express the gene. Generally, procaryotic. yeast, insect, or mammalian cells are used as hosts. Procaryotic 
hosts are in general the most efficient and convenient for the production of recombinant proteins and are therefore pre- 
ferred for the expression of the protein. 

The procaryote most frequently used to express recombinant proteins is E. coli. However, microbial strains other 
than £ coli can also be used, such as bacilli, for example Bacillus subtilis, various species of Pseudomonas, and other 
bacterial strains, for recombinant expression of the protein. In such procaryotic systems, plasmid vectors that contain 
replication sites and control sequences derived from the host or a species compatible with the host are typically used. 

For expression of constructions under control of most bacterial promoters, E. coli K12 strain MM294, obtained from 
the E coll Genetic Stock Center under GCSC #61 35, can be used as the host. For expression vectors with the PlNrbs 
or P J7RBS control sequence, £ coli K12 strain MC1000 lambda lysogen, N7N53CI857 SusPao, ATCC 39531. may be 
used. £ coli DG1 16 , which was deposited with the ATCC (ATCC 53606) on April 7, 1987, and £ coll KB2, which was 
deposited with the ATCC (ATCC 53075) on March 29. 1985, are also useful host cells. For M13 phage recombinants. 
£ coli strains susceptible to phage Infection, such as £ coli K12 strain DG98 (ATCC 39768). are employed. The DG98 
strain was deposited with the ATCC on July 13, 1984. 

For example. £ coll is typically transformed using derivatives of pBR322, described by Bolivar et aL, 1977, Gene 
£:95. Plasmid pBR322 contains genes for ampicillin and tetracycline resistance. These drug resistance markers can be 
either retained or destroyed in constructing the desired vector and so help to detect the presence of a desired recom- 
binant. Commonly used procaryotic control sequences, i.e.. a promoter for transcription initiation, optionally with an 
operator, along with a ribosome binding site sequence, include the p-lactamase (penicillinase) and lactose (lac) pro- 
moter systems (Chang a/.. 1977. Nature 198:1056), the tryptophan (trp) promoter system (Goeddel et al„ 1980. 
Nuc. Acids E^. S:4057). and the lambda-derived ?i promoter (Shimatake et a}., 1981 , Nature 22^:128) and gene N 
ribosome binding site (Nrbs). A portable control system cassette is set forth in U.S. Patent No. 4,711,845, issued 
December 8. 1987. This cassette comprises a Pl promoter operably linked to the Nrbs >" turn positioned upstream of 
a third DNA sequence having at least one restriction site that permits cleavage within six base pairs 3' of the Nrbs 
sequence. Also useful is the phosphatase A (phoA) system described by Chang ef a/., in European Patent Publication 
No. 196,864, published October 8, 1986. However, any available promoter system conpatible with procaryotes can be 
used to construct a expression vector of the invention. 

In addition to bacteria, eucaryotic microbes, such as yeast, can also be used as reconijinant host cells. Laboratory 
strains of Saccharomyces cerevisiae. Baker's yeast, are most often used, although a number of other strains are com- 
monly available. While vectors employing the two micron origin of replication are common (Broach, 1983. Meth . Enz . 
101:307), other plasmid vectors suitable for yeast expression are known (see, for exarr^le, Stinchcomb et al., 1979, 
Nature282:39; Tschempe a/., 1980, Gene 10:157; and Clarke etal., 1983, Meth- Eqz. 101:300). Control sequences 
for yeast vectors include promoters for the synthesis of glycolytic enzymes (Hess ef a/. 1968 J. Ad^ Enzyme Beg. 
2:149; Holland ef a/., 1978. PigtgghnPlogy 12:4900; and Holland ef a/., 1981. J. Biol. Chem . 256:1385). Additional pro- 
moters known in the art include the promoter for 3-phosphoglycerat6 kinase (Hitzeman et a!.. 1980 I iiol. Chem . 
255:2073) and those for other glycolytic enzymes, such as glyceraldehyde 3-phosphate dehydrogenase, hexokinase, 
pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate 
kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucoWnase. Other promoters that have the addi- 
tional advantage of transcr^tion controlled by growth conditions are the promoter regions for alcohol dehydrogenase 2. 
isocytochrome C. acid phosphatase, degradative enzymes associated with nitrogen metabolism, and enzymes respon- 
sit^e for maltose and galactose utilization (Holland, sypra) . 

Terminator sequences may also be used to enhance expression when placed at the 3" end of the coding sequence. 
Such terminators are found in the 3' untranslated region following the coding sequences in yeast-derived genes. Any 
vector containing a yeast-compatible promoter, origin of replication, and other control sequences is suitable for use in 
constructing yeast expression vectors. 

The coding sequence can also be expressed in eucaryotic host ceil cultures derived from multicellular organisms. 
See, for example. Tissue Culture. Academic Press. Cruz and Patterson, editors (1973). Useful host cell lines include 
COS-7. C0S-A2, CV-1. murine cells such as murine myelomas N51 and VERO. HeLa cells, and Chinese hamster 
ovary (CHO) cells. Expression vectors for such cells wdinarily include promoters and conti-ol sequences compatible 
with mammalian cells such as, for example, the commonly used early and late promoters from Simian Virus 40 (SV 40) 
(Rers e/a/., 1 978, Nature 273:1 13), or other viral promoters such as those derived from polyoma, adenovirus 2. bovine 
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papilloma virus (BPV), or avian sarcoma viruses, or immunoglobulin promoters and heat sliock promoters. A system for 
expressing DNA in mammalian systems using a BPV vector system is disclosed in United States Patent No. 4,419,446. 
A modification of this system is described in U.S. Patent No. 4.601 .978. General aspects of mammalian cell host system 
transformations have been described by Axel, U.S. Patent No. 4.399,216. "Enhancer" regions are also inportant in opti- 
mizing expression; these are, generally, sequences found upstream of the promoter region. Origins of replication may 
be obtained, if needed, from viral sources. However, integration into the chromosome is a common mechanism for DNA 
replication in eucaryotes. 

Plant cells can also be used as hosts, and control sequences compatible with plant cells, such as the nopaline syn- 
thase promoter and polyadenylation signal sequences (Depicker ef a/., 1982, J. Md. ApdI . Gen . 1:561) are available. 
Expression systems employing insect cells utilizing the control systems provided by baculovirus vectors have also been 
described (Miller eta!., in Genetic Engineering (1986). Setlow etaJ., eds., Plenum Publishing, Vol. 8. pp. 277-297). 
Insect cell-based expression can be accomplished in Spodoptera frugipeida. These systems are also successful in pro- 
ducing recombinant enzymes. 

Depending on the host cell used, transformation is done using standard techniques appropriate to such cells. The 
calcium treatment employing calcium chloride, as described by Cohen, 1972. Proc . Natl . Acad . Sd. USA 69:2110 is 
used for procaryotes or other cells that contain substantial cell wall ban-iers. Infection with Agrobacterium tumefadens 
(Shaw et af., 1 983. Gene 23:31 5) is used for certain plant cells. For mammalian cells, the calcium phosphate precipita- 
tion method of Graham and van der Eb. 1978, \tol2fl^5£:546 is preferred. Transformations into yeast are carried out 
according to the method of Van Solingen et al., 1977. J. iad- 110:946, and Hsiao etal., 1979. Proc . Natl . Acad . SCL- 
USA 76:3829. 

It may be desirable to modify the sequence of the DNA encoding the enzyme of the invention to provide, for exam- 
ple, a sequence more compatible with the codon usage of the host cell without modifying the amino acid sequence of 
the encoded protein. Such modifications to the initial 5-6 codons may improve expression efficiency. DNA sequences 
which have been modified to improve expression efficiency, but which encode the same amino acid sequence, are con- 
sidered to be equivalent and encompassed by the present invention, 

A variety of site-specific primer-directed mutagenesis methods are available and well-known in the art (see, for 
example, Sambrook etai, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, 1989. second edition, chapter 
15.51, "Oligonucleotide-mediated mutagenesis." which is incorporated herein by reference). The polymerase chain 
reaction (PGR) can be used to perform site-specific mutagenesis. In another technique now standard in the art, a syn- 
thetic oligonucleotide encoding the desired mutation is used as a primer to direct synthesis of a complementary nucleic 
acid sequence contained in a single-stranded vector, such as pBSM13+ derivatives, that serves as a template for con- 
struction of the extension product of the mutagenizing primer. The mutagenized DNA is transformed into a host bacte- 
rium, and cultures of the transformed bacteria are plated and identified. The identification of modified vectors may 
involve transfer of the DNA of selected transfbrmants to a nitrocellulose filter or other membrane and the "lifts" hybrid- 
ized with Wnased synthetic mutagenic primer at a tenperature that permits hybridization of an exact match to the mod- 
ified sequence but prevents hybridization with the original unmutagenized strand. Transformants that contain DNA that 
hybridizes with the probe are then cultured (tiie sequence of the DNA is generally confirmed by sequence analysis) and 
serve as a reservoir of the modified DNA. 

Once the protein has been expressed in a recombinant host cell, purification of the protein may be desired. A vari- 
ety of purification procedures can be used to purify the recombinant thermostable DNA polymerase of the invention. 
Examples include the methods for purifying Taq DNA polymerase described in U.S. Patent No. 4,889,818; 5,352,600; 
and 5,079,352; the methods for purifying the DNA polymerase from Thermus thermophilis ( Tth) described in U.S. Pat- 
ent Nos. 5,618,711 and 5,310,652; the methods for purifying Tma DNA polymerase described in U.S. Patent Nos. 
5,374,553 and 5,420,029. Methods for purifying these DNA polymerases are also described in U.S. Patent No. 
5,466.591 . All of tile above patents are incorporatKi herein by reference. 

In a preferred method, the expression of the DNA polymerase is carried out in E. cofi, which is a mesophilic bacte- 
rial host cell. Because E. coli host proteins are heat-sensitive, the recombinant thermostable DNA polymerase can be 
substantially enriched by heat inactivating the crude lysate. This step is done in ttie presence of a sufficient amount of 
salt (typically 0.2-0.4 M ammonium sulfate) to reduce ionic interactions of the DNA polymerase witii other cell lysate 
proteins. 

Activity of the purified DNA polymerase is assayed as described in Lawyer ef a!., 1989, J. Bio!. Chem . 264:6427. 
incorporated herein by reference. 

For long-term stability, the purified DNA polymerase enzyme must be stored in a buffer tiiat contains one or more 
non-ionic polymeric detergents. Such detergents are generally ttiose that have a molecular weight in the range of 
approximately 100 to 250,00 preferably about 4,000 to 200.000 daltons and stabilize the enzyme at a pH of from about 
3.5 to about 9.5. preferably from about 4 to 8.5. Examples of such detergents include those specified on pages 295-298 
of McCutcheon's Emulstfiers & Detergents. North American edition (1983). published by the McCutcheon Division of 
MC Publishing Co., 175 Rock Road, Glen Rock. NJ (USA), the entire disclosure of which is incorporated herein by ref- 
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erence. Preferably, the detergents are selected from the group comprising ethoxylated fatty alcohol ethers and lauryl 
ethers, ethoxylated alkyi phenols, octylphenoxy polyethoxy ethanoi compounds, modified oxyelhylated and/or oxypro- 
pylated straight-chain alcohols, polyethylene glycol nrronooleate compounds, polysorbate compounds, and phenolic 
fatty alcohol ethers. More particularly preferred are Tween 20^^, a polyoxyethyiated (20) soititan monolaurate from ICI 
Americas Inc. (Wilmington, DE). and Iconol™ NP-40, an ethoxylated alkyI phenol (nonyl) from BASF Wyandotte Corp. 
(Parsippany, NJ). 

The thermostable enzyme of this invention may be used for any purpose in which a thermostable DNA polymerase 
is necessary or desired. In a preferred embodiment, the enzyme is for DNA sequencing (see Innis et al., 1988, Proc. 
NaSL Acad. ScL USA 85:9436-9440, incoiporated herein by reference). 

The following examples are offered by way of illustration only and are by no means intended to limit the scope of 
the claimed invention. In these examples, all percentages are by weight if for solids and by volume if for liquids, unless 
othenvlse noted. 

Example 1 

Construction of an Exr^r ession System 

An expression system is constructed from the deposited plasmid, pUC18:Tma25, which contains the gene having 
nucleotide sequence SEQ ID NO: 9. using conventional techniques well known in the art. The steps involved, which are 
described in more detail below, are as follows. 

- The DNA polymerase coding sequence contained in pUC18:Tma25 is subcloned into a pDG160 expression vector, 
resulting in plasmid pTMA25. 

- The D323A and E325A mutations are added to pTMA25 by site-specific primer-directed mutagenesis, resulting in 
plasmid pTMASO. 

- The mutated gene coding sequence from pTMA30 is then subcloned into a pDG184 expression vector such that 
codons 1-283 are deleted, resulting in plasmid pTMA31. 

- The F730Y mutation is added to pTMA31 by site-specific primer-directed mutagenesis, resulting in plasmid 
pTMA31[F730Y]. 

- A fragment of the mutated coding sequence from pTIVIA31[F730Y] containing the F730Y mutation is subcloned into 
pTMA30 to replace the corresponding unmutated fragment, resulting in plasmid pTMA30IF730Y]. 

Following each mutagenesis or sub-cloning step, £ coli strain DG1 1 6 host cells are transformed with the plasmid 
constructs. Ampicillin resistant (plasmid containing) colonies are screened for the presence of the desired plasmid 
using standard methods. Typically, first colonies are selected for the presence of a plasmid of the expected size by gel 
electrophoretic size fractionation. Candidate colonies are further screened for pfasmids exhibiting the expected frag- 
ment pattern following digestion with one or more restriction enzymes. Rnally, mutagenized sites and ligation junctions 
are sequenced to confirm the intended sequence. 

Plasmid pDGI 60 is described in U.S. Patent No. 5,61 8,71 1 . incorporated herein by reference. Plasmid pDG1 60 is 
a cloning and expression vector that comprises the bacteriophage k Pi promoter and gene N ribosome binding site 
(see U.S. Patent No. 4,711,845, incorporated herein by reference), a restriction site polylinker positioned so that 
sequences cloned into the polylinker can be expressed under the control of the A. Pl promoter and gene N ribosome 
binding site, and a transcription terminator from the Bacillus thuhngiensis delta-toxin gene (see U.S. Patent No. 
4,666,848. incorporated herein by reference). Plasmid pDG160 also carries a mutated RNA II gene, which renders the 
plasmid temperature sensitive for copy number (see U.S. Patent No. 4,631.257, incorporated herein by reference). 

These elements act in concert to make plasmid pDGl60 a very useful and powerful expression vector. At 30-32*»C, 
the copy number of the plasmid is low, and in an host cell that carries a temperature-sensitive repressor gene, such as 
CI857, the Pl promoter does not function. At 37-41 °C, however, the copy number of the plasmid is 50-fokl higher than 
at 30-32°C, and the cl857 repressor is inactivated, allowing the Pl promoter to function. Plasmid pDG160 also carries 
an ampicillin resistance (AmpR) marker. In summary, plasmid pDGl60 comprises the AmpR marker, the Pl promoter 
and gene N ribosome binding site, a polylinker, and the BT cry PRE (BT positive retroregulatory element, U.S. Patent 
No. 4,666,848) in a ColEI cop^ vector. 

Plasmid pDG184 is described in U.S. Patent No. 5,420,029, incorporated herein by reference. Plasmid pDG184 is 
a derivative of pDQ160, modified to include an Nco I site at the start codon of the inserted gene. The rest of the plasmid 
is functionally unchanged from pDG160. 
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I. Sub-doning I 

The DNA polymerase coding sequence is subcloned from plasmid pUC18:Tma25 into a pDG160 expression plas- 
mid as follows: 

5 

A. Plasmid pUC18:Tma25, a 5347 base pair (bp) plasmid. is linearized by digestion with Nsp V, which cuts once at 
position 2084 (numbered starting with the first nucleotide of the coding sequence). 

B. Tlie linearized plasmid resulting from the Nsp V digestion is digested further with Bam HI, which cuts at nucle- 
otide (nt) positions 1661, 1989, 2039, and 2686. A 602 bp Nsp V/Bam HI fragment (nt 2085-2686) containing the 

10 3' end of the DNA polymerase gene is gel purified. 

C. In a separate reaction, linearized plasmid resulting from the Nsp V digestion is digested further with Hind III, 
which cuts at positions 2629 and 5342. A 2089 bp Nsp yJHind III fragment (nt 5343-2084) containing the 5' end of 
the DNA polymerase gene is gel purified. 

D. Plasmid pDG160 is digested with Hindi III and Bam HI and treated with caff intestinal alkaline phosphatase 
is (CIAP) to remove the 5' phosphate and prevent religation and reconstruction of the vector. Alternatively, the 

digested vector fragment is gel purified. 

E. The isolated fragments from steps B and C are combined with the digested pDGl60 plasmid from step D in a 
2:2:1 ratio at a concentration of 10-40 ng/jil of total DNA and ligated, resulting in a 8218 bp plasmid. 

D. The ligation product is transformed into £ coli DQ1 16 cells (described above) and transformant colonies which 
20 contain the desired plasmid, designated pTMA25, are identified by screening. 

II. Mutaoenesis I: D323A and E325A 

Mutations in the DNA polymerase coding sequence of pTMA25 which result in the D323A and E325A amino acid 
25 mutations are made using site-specific primer-directed mutagenesis. For convenience in later manipulations, additional 
mutations are made which eliminate a Bgl II restriction enzyme cleavage site and create an Spe I restriction enzyme 
cleavage site. These additional mutations are made such that the encoded amino acid sequence is unchanged. 
The following primers are used in the mutagenesis: 

30 ' Primer PI : mutagenic upstream primer corresponding to nucleotides 958-988 of SEQ ID NO: 9. with mutations as 
described in the table below. 

Primer P2: mutagenic downstream primer consisting of the reverse complement of primer Pi . 
- Primer P3: upstream primer corresponding to nucleotides 608-627 of SEQ ID NO: 9, which encorrpasses an Xba 
I site (nucleotides 621-626). 

35 - Primer P4: downstream primer corresponding to nucleotides 1319-1339 of SEQ ID NO: 9, which encompasses 
part of a Sac I site (nucleotides 1318-1323). 

The sequence of mutagenic upstream primer Pi consists of nucleotides 958-988 of the coding strand of SEQ ID 
NO: 9, except for the changes shown in the table below. The change in codon 323 (nucleotides 967-969) resulted in the 
40 elimination of a Bgl II site. The changes in codons 326 (nucleotides 966-978) and 327 (nucleotides 979-981) do not 
affect the sequence of the encoded amino acid, but results in the creation of a Spe I site. 



Mutations In the primer PI 


nucleotides 


codon 


nucleotide change 


amino acid change 


967-969 


323 


GAT -> GOT 


D323A 


973-975 


325 


GAG-> GOG 


E325A 


976-978 


326 


ACG -> ACT 


none 


979-981 


327 


TCT ->AGT 


none 



The mutagenesis is carried out as described below. All anplifications are carried out by PGR under conditions well 
known in the art. For example, amplifications may be carried out using the GeneAmp PGR Reagent Kit with AnpliTaq® 
DNA Polymerase (PerWn Elmer, Nonvalk, CI). 
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A. A region of the coding sequence is amplified from purified pTMA25 using primers P3 and P2, and the resulting 

381 bp amplified product is gel purified. 

B. A region of the coding sequence is amplified from purified pTMA25 using primers PI and P4, and the resulting 

382 bp amplified product is gel purified. 

5 C. The amplified products from steps A and B are combined, heat denatured at 95*C, annealed, and extended with 
DNA polymerase using standard techniques. 

D. The annealed and extended duplex DNA from step C is re-amplified using primers P3 and P4, and the resulting 
732 bp amplified product is gel purified. 

E. The amplified DNA from step D is digested with Xba I and Sac I 

10 F. Plasmid pTMA25 is digested with Xba I and Sac \, and treated with calf intestinal alkaline phosphatase (CIAP) 
to remove the 5' phosphate and prevent religation and reconstruction of the vector. 

G. The digested DNA from step E Is combined with the digested plasmid from step F in a 3:1 ratio and llgated. 

H. The ligation product is transformed into £ coli DQ1 16 cells and transformant colonies which contain the desired 
plasmid. designated pTMASO. are identified by screening. 

15 

III. Sub-cloninp II 

The mutated gene coding sequence from pTMA30 is then subcloned Into a pDG184 expression vector such that 
codons 1-283 are deleted. Nucleotide position numbers used herein refer to the position within the plasmid, wherein 
20 position 1 is defined by the Eco Rl site upstream of the Pl promoter. The sub-cloning is carried out as follows: 

A. Plasmid pTMA30, a 8218 bp plasmid, is digested with Mlu I, which cuts at nucleotide position 4443; Bsp HI, 
which cuts at positions 1210, 4761 , 5769, and 5874; and Afl II, which cuts at position 7827. The Afl II digestion is 
carried out to further degrade a 3554 bp Bsp HUBsp HI fragment, which is similar in size to the desired 3233 bp 

26 Bsp HUMIu I fragment, in order to facilitate isolation of the desired fragment. The digestion yields six fragments, 
with lengths of 3233, 1952, 1601, 1008, 318, and 105 bp. The 3233 bp Bsp HUMIu I fragment corresponding to 
nucleotides 1211 -4443 of the plasmid is isolated by gel electrophresis. 

B. Plasmid pDG184, a 5474 bp plasmid, is digested with Mlu I, which cuts at position 1699, and Nco I. which cuts 
at position 284. The digested fragments are treated with calf intestinal alkaline phosphatase (CIAP) to remove the 

30 5* phosphate and prevent religation and reconstruction of the vector. Alternatively, the 4059 bp fragment is isolated 
by gel electrophoresis. 

C. The isolated fragment from step A is combined with the digested pDG184 plasmid from step B in a 1 :1 ratio at 
a concentration of 10-40 ng/^l of total DNA and ligated, resulting in a 7292 bp plasmid. 

D. The ligation product is transformed into £ cofi DG1 16 cells and transformant colonies which contain the desired 
35 plasmid, designated pTMA31 , are Identified by screening. 

IV. Mutaoenesis II: F73QY 

Additional mutations In the DNA polymerase coding sequence of pTMASI which resulted in the F730Y mutation in 
40 the encoded amino acid sequence mutations were made using site-specific primer-dtrected mutagenesis. The muta- 
genesis was carried out using methods analogous to those described above. 
The following primers were used in the mutagenesis. 

- Primer FR1 : mutagenic upstream primer corresponding to nucleotides 2173-2202 of SEQ ID NO: 9, with mutations 
4S as described in the table below. 

- Primer FR2: mutagenic downstream primer essentially consisting of the reverse complement of primer FR1, but 
corresponding to nucleotides 21 72-2200 of SEQ ID NO: 9. 

- Primer FR3: upstream primer corresponding to nucleotides 1952-1972 of SEQ ID NO: 9, which lies upstream of a 
esfXIslte. 

50 - Primer FR4: downstream primer corresponding to nucleotides 241 5-2433 of SEQ ID NO: 9, which lies downstream 
of a Xma I site. 

The sequence of mutagenic upstream primer FR1 consists of nucleotides 2173-2202 of the coding strand of SEQ 
ID NO: 9, except for the changes shown in the table below. The change in codons 729(2185-2187) does not affect the 
55 sequence of the encoded amino acid, but results in the creation of a Hpa I site. 
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Mutations in the primer FR1 


nucleotides 


codon 


nucleotkle change 


amino acid change 


2185-2187 


729 


AAT->AAC 


none 


2188-2190 


730 


TTT->TAT 


F730Y 



The mutagenesis was can'ied out as described below. 

A. A region of the coding sequence was anrplified from purified p^ASI using primers FR3 and FR2, and the 
resufting 249 bp anriplrfled product was gel purified. 

B. A region of the coding sequence was amplified from purified pTMA31 using primers FR1 and FR4, and the 
resulting 261 bp amplified product was gel purified. 

C. The amplified products from steps A and B were combined, heat denatured at SS'^C, annealed, and extended 
with DNA polymerase using standard techniques. 

D. The annealed and extended duplex DNA from step C was re-amplified using primer FR3 and FR4, and the 
resulting 482 bp amplified product was extracted using a phenoi/chloroform mixture and precipitated with EtOH. 

E. The amplified DNA from step D was digested with Bst XI and Xma \, and the desired 337 bp DNA fragment was 
separated from smaller fragments using a CENTRICON 100 column (Amicon, Beverly. MA). 

F Plasmid pTMA31 was digested with Bst XI and Xba I. 

G. The digested DNA from step E was combined with the digested plasmid from step Fin a 3:1 ratio and ligated. 

H. The ligation product was transformed into E coli DQ116 cells. Colonies were screened for the presence of the 
desired mutated plasmid by amplifying the plasmid DNA using primers FR3 and FR4, which amplify a region 
encompassing the unique Hpa I site introduced during the mutagenesis, digesting the anplified product with Hpa 
\, and analyzing the digestion product by gel electrophoresis. A colony containing the desired plasmid, designated 
pTMA31[F730Y], was selected and the gene sequence was confirmed by DNA sequencing. 

The resulting expression system expresses a DNA polymerase, designated F730Y7ma31 DNA Polymerase, that 
consists of amino acids 284-893 of Tma DNA polymerase, mutated with the D323A, E325A, and F730Y mutations. 

V Sub-cloning III 

A fragment of the mutated coding sequence from pTMA31[F730Y] containing the F730Y mutation was subcloned 
into pTMA30 to replace the corresponding unmutated fragment, resulting in plasmid pTMA30IF730Y]. Nucleotide posi- 
tion numbers used herein refer to the position within the plasmid, wherein position 1 is defined by the Eco Rl site 
upstream of the X Pl promoter. The sub-cloning was carried out as follows. 

A. Plasmid pTMA31[F730YI, a 7292 bp plasmid, was digested with Mfu \, which cuts at nucleotide position 3517, 
and Spe I. which cuts at position 41 2. The 31 05 bp Mlu MSpe I fragment corresponding to nucleotides 41 3 to 351 7 
of the plasmid was isolated by gel electrophresis. 

B. Plasmid pTMA30, a 8218 bp plasmid, is digested with Mlu I, which cuts at nucleotide position 4443, and Spe I, 
which cuts at position 1338. The 511 3 bp Mlu USpe I fragment corresponding to nucleotides 4444-1 338 of the plas- 
mid fragment was isolated by gel electrophoresis. 

C. The isolated fragment from step A is combined with the isolated fragment from step B in a 1 :1 ratio at a concen- 
tration of 10-40 ng/^l of total DNA and ligated. 

D. The ligation product was transformed into £ coli DG1 16 cells. Colonies were screened for the presence of the 
desired 8.2 kb plasmid by amplifying the plasmid DNA using primers which amplify regions encompassing the 
unique Hpa I and Spe I sites introduced during the mutatageneses, digesting the amplified products with Hpa I or 
Spe I, and analyzing the digestion products by gel electrophoresis. Plasmid DNA was prepared from colonies that 
contained plasmids which exhibited the expected digestion pattern in the screen, and was further analyze! by 
digestion with Hpa I, Spe I. and Mlu I followed by gel analysis of the digested DNA. A colony containing the desired 
plasmid, designated pTMA30[F730Y], was selected and the gene sequence was confirmed by DNA sequencing. 

The resulting expression plasmid, pTMA30[F730Y], is under the control of the bacteriophage X P|_ promoter and 
gene N ribosome binding site, and a Positive Retroregulatory Element (PRE, transcription terminator) from the Bacillus 
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thuringiensis delta-toxin gene. The plasmid also carri^ a mutated RNA II gene which renders the plasmid temperature 
sensitive for copy number and an ampicillin resistance gene. 

Example 2 

Expression of the recom binant DNA polvmerase 

This example describes the expression and purification of F730Yrma30 DNA Polymerase using an expression 
system, £ cofi K12 strain DG116 cells harboring plasmid pTMA30[F730Y], essentially as described in example 1. 

Initial growth of the expression system cells was can-led out in a seed flask. Large scale fermentation was carried 
out in a 10 liter fermentation flask Inoculated with the seed culture. The media and protocols used were as follows. 

The seed medium consisted of 1X Bonner-Vogel salts (9.6 mM citric acid, 57 mM K2HPO4, 16.8 mM NaNH4HP04, 
0.8 mM MgS04), + 25 mM (NH4)2S04, 2 mM MgS04. 10 \ig/m\ thiamine-HCt, 0.2% glucose. 0.25% casamino adds, 
and 100 ng/ml ampicillin and methicillin. The medium was formulated from sterile stock solutions, then filter-sterilized 
prior to use. 

The fermentation medium consisted of IX Bonner-Vogel salts (9.6 mM citric acid. 57 mM K2HPO4, 16.8 mM 
NaNH4HP04, 0.8 mM MgS04), + 25 mM (NH4)2S04, 2 mM MgS04. 10 MnS04, 6.9 ^M ZnClg, 8.4 ^M CoClg, 8.3 
nM NaMo04. 6.8 mM CaCl2, 7.4 ^M CuCIa, 8.1 ^lM H3BO3, 1 \M FeCIa, 0.5 ml/l Macoll P2000 antitbam, 10 ^ig/ml thia- 
mine* HCI, 1.6% glucose. 2.0% casamino adds, and 100 ng/ml ampicillin. The above ingredients (through the anti- 
foam) were sterilized in situ at 12rc for 20 minutes, and the rest added from sterile, stock solutions, just prior to 
inoculation. 

The seed culture was grown in a 100 ml flask of seed medium inoculated with 0.1 ml of frozen expression system 
cells. Following inoculation, the culture was shaken overnight at 30°C. The entire flask culture was used to inoculate a 
10 liter fermentor culture. 

Fermentation was carried out as follows. The initial temperature was 30^C. the pH was controlled at 6.9+/-0.1 with 
4N NH4OH and glacial acetic acid, and the dissolved oxygen controlled at 30% by adjusting the agitation rale as 
needed from an initial, minimum value of 300 rpm. The aeration rate was held constant at 5 liters per minute. When the 
culture reached 2.5 OD (680 nm), after about 6-7.5 hours, the temperature was shifted to 38.5**C to induce synthesis of 
the DNA polymerase using a ramp rate of 0.40**C/mlnute. The fermentation was allowed to continue overnight, to a total 
run time of about 24 hours. Cell paste was harvested by cross-flow filiation and centrifugation, and frozen at -20°C. 

Example 3 

Purtfication of the recom binant DNA polymerase 

This example describes the purification of the expressed F730Yrma30 DNA Polymerase from the fermentation 
described above. The purification was carried out essentially as described in Lawyer et a!., 1993. PGR Method and 
Applications 2:275-287, with modifications as described below. 

The following standard abbreviations are used. 

PEI o polyethylenlmine 

TLCK = N-a-p-tosyl-L-lysine chloromethyl ketone-HCI 

PEI is available commercially from, among others, Polysciences, Inc. (Warrington, PA). 
TLCK is available commercially from, among others, Sigma Chemical Co. (St. Louis, MO). 

Approximately 1 50 grams of frozen (-70*'C) cells from the fermentation were thawed in lysis buffer (50 mM Tris-HCI, 
pH 7.5) containing 10 mM EDTA, 1 mM dithiothreitol (DTT), 2 mM Pefabloc SC (CenterChem, Inc., Stamfond. CT); 1 
tig/ml Leupeptin (Boehringer Mannheim, Indianapolis, IN), and 1 mM TLCK. The cells were lysed by passage five times 
through a Microfluidizer at 1 0,000 psi. The lysate was diluted with lysis buffer to a final volume of 5.5X cell wet weight. 
The resulting lysate was designated Fraction I. 

Ammonium sulfate was gradually added to the Fraction I lysate to a concentration of 0.2 M. Fraction I then was PEI- 
precipitated as follows. 

PEI titrations were used to determine the minimum amount of PEI necessary to precipitate nucleic adds. Ten ^1 of 
each trial predpitation were added to 100 nl of 0.5 jig/ml Ethidium Bromide in a standartl microwell plate. Standards 
consisted of af^ropriately diluted lysate containing no PEI. The plate was illuminated with UV light, and the concentra- 
tion of PEI needed to remove at least 99% of the nucleic acid was determined. 

PEI was added slowly with stirring to 0.4% (concentration as determined from the titi-ations). The PEI treated lysate 
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was centrifuged in a JA-10 rotor (500 ml bottles) at 8,000 RPM (1 1,300 x g) for 30 minutes at 5*0. The supernatant 
(Fraction II) was decanted and retained. 

Amnnonium sulfate was added to the Fraction II supernatant to a concentration o1 0.4 M. Fraction II then was heat- 
treated as follows. 

5 The heat treatment was carried out in a 3 liter Braun fermentor The agitation rate was 250 rpm. The temperature 
was increased to 75** C over 6 minutes, held for 15 minutes, then cooled in the fermerrtor to 30* C as rapidly as possible. 
The heat-treated Fraction II supernatant from the PEI precipitation was removed from the fermentor and held on ice for 
at least 30 minutes, then centrifuged as described above. The supernatant (Fraction III) was decanted and retained. 
Fraction III was subjected to phenyl sepharose column chromatography as follows. A 250 ml radial flow column 

10 (Sepragen Corp., Hayward, CA) was packed with Phenyl Sepharose Fast Flow (High Sub) (Pharmacia, Piscataway, 
NJ). Fraction III was diluted with 50 mM Tris (pH 7.5), 10 mM EDTA to reduce the ammonium sulfate to 0.3 M and then 
applied to the column. The column was washed (flow rate of 50 ml/minute) for 15-20 minutes (3-4 column volumes) in 
each of the following 4 buffers: (1) 50 mM Tris, pH 7.5. 10 mM EDTA, 0.3 M ammonium sulfate, 1 mM DTT; (2) 26 mM 
Tris, pH 7.5, 1 mM EDTA, 1 mM DTT; (3) 25 mM Tris. pH 7.5, 1 mM EDTA, 20% vA^ ethylene glycol. 1 mM DTT; and (4) 

IS 25 mM Tris, pH 7.5, 1 mM EDTA, 20% vA^ ethylene glycol, 1 mM DTT, 2.0 M urea. The urea eluate containing the DNA 
polymerase (Fraction IV) was collected as a single pool from approximately 3 to 18 minutes of the urea elution. The 
entire phenyl sepharose column step was completed in under 2 hours. 

Fraction IV was subjected to heparin sepharose column chromatography as follows. Fraction IV (about 750 ml) was 
made 0.05 M in KCI (from a 3 M stock) and then loaded onto a 100 ml radial flow heparin sepharose column, which had 

20 been equilibrated in 25 mM Tris, pH 7.5, 1 mM EDTA, 0.05 M KCI, 1 mM DTT After the load, the column was washed 
(flow rate of 20 ml/minute) for 30 minutes in equilibration buffer, then in 25 mM Tris, pH 7.5, 1 mM EDTA, 0.10 M KCI, 1 
mM DTT Finally the DNA polymerase was eluted in a 12 column volume gradient in 25 mM Tris, pH 7.5, 1 mM EDTA, 
and 0.10 to 0.5 M KCI, 1 mM DTT, collecting 75 fractions of 16 ml each. The heparin sepharose column step was com- 
pleted in less than 3 hours. Fractions were analyzed by SDS-PAGE and some early fractions containing DNA polymer- 

25 ase that are less pure were removed from the pool (Fraction V). 

Fraction V was concentrated to 20 ml on an Amicon YM30 membrane (Amicon Inc., Beverly, MA). The concentrate 
was dialyzed overnight at 4<*C against 3X storage buffer (60 mM Tris, pH 8.0, 0.3 mM EDTA, 0.3 mM KCI, 3 mM DTT). 
Glycerol was added to the dialysate to a final concentration of 50 % (v/v) from an 80% (v/v) stock. Tween 20™ was 
added was added to a final concentration of 0.2% (w/v) from a 10% (w/v) stock. Sterile water was added to bring the 

30 volume of the preparation to 3 times the volume of the original lysate, yielding Fraction VI, a storage-stable preparation 
of F730Yrma30 DNA Polymerase. 

Fraction VI was assayed for DNA polymerase activity essentially as described in Lawyer et al., 1989, J. Biol. Chem. 
264:6427, incorporated herein by reference. 

55 Example 4 

Extension Rate 

The extension rate off the F730Yrma30 DNA Polymerase was measured using a template-limited primer extension 

40 assay The assay was carried out using an excess of DNA polymerase, under which conditions the rate of extension is 
independent of the DNA polymerase concentration. 

The chimeric enzyme of the present invention, F730Y7/na30 DNA Polymerase, was compared to F730Yrma31 
DNA Polymerase, expressed from plasmid pTMA31IF730Y], described above. F730YTma31 DNA Polymerase is a 
mutated version of UlTma™ DNA Polymerase (PerWn Elmer, Nonwalk, CT) that incorporates the D323A and E325A 

45 mutations which inactivate the 3' to 5* exonuclease activity, and the F730Y mutation. F730Y7/r7a30 DNA Polymerase 
and F731Yrma31 DNA Polymerase differ primarily in that F730YTma30 DNA Polymerase contains the 5'-nuclease 
domain from 7aQ DNA polymerase which has been mutated to inactivate the 5-nuclease activity, whereas 
F730Yr/r7a31 DNA Polymerase is missing the first 283 amino acids of Tma DNA polymerase. Accoidingly, 
F730Yrma31 DNA Polymerase lacks 5-nuclease activity as a result of a deletion off most of the 5'-nuclease domain. 

50 DNA polymerase preparations first were assayed as described in Lawyer et af., 1989, J. Bid. Chem . 264:6427, to 
determine the unit concentration and to determine an amount of enzyme needed such that the enzyme would be in 
excess. Based on these assays, it was determined that the use of 1 unit of F730Y7"ma30 DNA Polymerase or 3.5 units 
of F730Yrjma31 DNA Polymerase in the extension rate assay described below was sufficient to insure that the exten- 
sion rate would be independent of enzyme concentration. The definition of a unit of enzyme is as defined in Lawyer et 

55 a/., 1989, supra. 

Extension rate was assayed for 3 minutes at 75**C in a 50 ^1 reaction mixture containing 5 ^1 of DNA polymerase 
(diluted as described in Lawyer et aL, 1 989, supra, to contain the unit anwunt described above) and 45 fil of a reaction 
buffer containing 50 mM Bicine, pH 8.3, 25*'C; 2.5 mM MgClg: 1 mM p-mercaptoethanol; 200 |iM each of dATP, dGTP 
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and dTTP; 100 \jM [a-^^PJdCTP (0.8 ^Ci/reaction); and 0.075 pmdes of the M13rTp18 (Perkin Elmer, Norwalk, CT) 
template DNA preannealed to primer DG48. (SEQ ID NO: 11; S-GGGAAGGGCGATCGGTGCGGGCCTCTTCGC). 
The reactions were stopped by the addition of 10 fil 60 mM EDTA and stored at 0**C. 

A 25 ^1 portion of the stopped reaction was diluted with 1 ml of 2 mM EDTA with 50 ^g/ml sheared salmon sperm 

5 DNA as a can'ier. The DNA was precipitated by the addition of 1 ml 20% trichloroacetic acid (w/v) and 2% sodium pyro- 
phosphate, and incubated at O^'C for 15 minutes. Precipitated DNA was collected on GF/C filter discs (Whatman Inter- 
national Ltd.. Maidstone. England) and washed extensively with 5% trichloroacetic acid and 2% sodium pyrophosphate, 
then with 5% trichloroacetic acid, then with 5 ml of 95% ethanol, dried, and counted. 

The amount of [a-^^P]dCMP incorporated per minute was determined for each sample. The data shown below rep- 

10 resent the average of two reactions. 



DNA Polymerase 


CPM 


F730Y7*ma30 
F73QYTmaZ^ 


1575 
1116 


Ratio 


1.41 



20 

The data indicate that, as measured by the above assay. F730YrAna30 DNA Polymerase has a 41% greater exten- 
sion rate than F730Y7"/77a31 DNA Polymerase. In view of the difference between the two enzymes, the data indicate 
that the presence in F730Y7ma30 DNA Polymerase of the 5'-nuclease domain from Taq DNA polymerase, although 
inactivated by the G46D mutation, results in a significantly higher extension rate. 
25 The extension products from a series of time points were analyzed further by denaturing agarose gel electrophore- 
sis, which confirmed that the results represent an increase in the extension rate of the enzyme. 

Example 5 

30 Dve Terminator Cvcle Sequencinp 

This example demonstrates the application of the F730Y Tma30 DNA Polymerase to dye-labeled, dideoxy-termina- 
tor cycle sequencing. For comparison, cycle sequencing reactions also were carried out using AmpliTaq® DNA 
Polymerase, FS, a mutant form of Taq DNA polymerase that lacks exonuclease activity ard incorporates an F667Y 

35 mutation, which is analogous to the F730Y mutation in F730YTma30 DNA Polymerase. 

Cycle sequencing reactions were carried out using the reagents and protocols of the ABI PRISM™ Dye Terminator 
Cycle Sequencing Core Kit with AmpliTaq® DNA Polymerase FS (Perkin Elmer, Nonwalk, CT). The separate packaging 
of the reagents in this kit allowed for easy substitution of F730Y7ma DNA polymerase for AmpliTaq® DNA Polymerase 
FS- In the kit, the AmpliTaq® DNA Polymerase FS is provided combined with rTth Thermostable Inorganic Pyrophos- 

40 phatase. For reactions using F730Yrma30 DNA Polymerase, the DNA polymerase/jayrophosphatase mixture of the kit 
was replaced with 10 units of F730YTma30 DNA Polymerase and 20 units of rTth TTiermostable Inorganic Pyrophos- 
phatase. rTth Thermostable Inorganic Pyrophosphatase is described in copending U.S. Patent No. 5.665,551 . incorpo- 
rated herein by reference. 

The positive control template, pGEM®-3Zf(+) and primer, -21 Ml 3, supplied with the kit were used. Reactions were 
45 carried out in a GeneAmp® PGR System 9600 thermal cycler (PerWn-Elmer, NonwalK CT) using the recommended 
thermal cycling protocol (25 cydes: 9B°C for 10 seconds; 50°C for 5 second; and 60°C for 4 minutes). 

Extension products were purified of unincorporated dye terminators by spin column purification using a Centri- 
Sep™ column from Princeton Separations (Ade^^hia, NJ) and dried in a vacuum centrifuge, as recommended in the 
protocol. Samples were resuspended in 6 ^ of loading buffer (deionized formamide and 25 mM EDTA (pH 8.0) contain- 
so ing 50 mg/l Blue dextran in a ratio of 5:1 formamide to EDTA/Blue dextran). The samples were votexed, spun, heated 
to 90*»C for 3 minutes to denature, and then directly loaded onto a pre-electrophoresed 48 cm (well-to-read) 4% poly- 
acrylamide/6 M urea gel and electrophoresed and analyzed on an ABI PRISM™ 377 DNA Sequencer (Perkin Elmer, 
Nonwalk, CT) according to the manufacturer's instructions. 

The resulting sequencing traces are shown in the figures. Figures 2A, 2B, and 2C provide a sequencing trace from 
55 a cycle sequencing reaction using F730Y7/77a30 DNA Polymerase, and Figures 3A, 3B, and 3C provide sequencing 
trace from a cycle sequencing reaction using AmpliTaq® DNA Polymerase, FS. The base calling was set to begin with 
the tenth nucleotide from the primer. 

It is clear from a comparison of the sequence tracings that the use of F730Yrma30 DNA Polymerase results in a 
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significant improvement in tlie overall uniformity of peak lieights when compared to the results obtained using Ampli- 
Taq® DNA Polymerase FS. In particular, the use of F730Yrn7a30 DNA Polymerase significantly increases the peak 
heights of those bases which, because of the DNA sequence context, result in very low peak heights when AnpliTaq® 
DNA Polymerase FS is used, such as Q after A or C. A after A or C, and T after T. Similarly, the use of F730YTma30 
DNA Polymerase significantly decreases the peak height of those bases which, because of the DNA sequence context, 
result in very high peak heights when Am|:^iTaq® DNA Polymerase FS is used, such as A after G. The uniformity of 
peak heights contributes to an increase in the accuracy of the sequencing. 

The accuracy of the sequencing, i.e.. the fraction of bases correctly sequenced, averaged for two duplicated reac- 
tions, was calculated from the results of the automated base-calling by the ABI PRISM™ 377 DNA Sequencing System 
analysis software. TTie results are summarized in the table, betow. Typically, sequencing enters are most prevalent in 
the region next to the primer and the terminal regions away from the primer Consequently, the first 10 nucleotides fol- 
lowing the primer were ignored and the accuracy was calculated separately for the subsequent 50 nucleotides, the next 
500 nucleotides, and finally two terminal regions, each 100 nucleotides in length. 



Conparison of Sequencing Accuracy 




nucleotide position: 




11-60 


61-560 


561-660 


661-760 


F730YTma DNA Polymerase 
AmpliTaq® DNA Polymerase FS 


95% 
97% 


100% 
99% 


100% 
97% 


97.5% 
88.5% 



The results demonstrate that F730YT/T7a30 DNA Polymerase provides a substantial improvement in sequencing 
accuracy; strikingly so at longer read lengths (> 560 nucleotides). The use of F730YT/77a30 DNA Polymerase com- 
pletely eliminated en'ors in the 500 nucleotide region from nucleotides 51-550 and the first terminal region from nucle- 
otides 551-650. Furthermore, the use of F730Yrma30 DNA Polymerase extended the length of target sequencable 
with an accuracy of at least 97% by at least 100 nucleotides, from 650 nucleotides using AmpliTaq® DNA Polymerase 
FS, to at least 750 nucleotides using F730Y7ma30 DNA Polymerase. 

Example 6 

Dye Primer Cycle Sequencing 

This example demonstrates the application of the DNA polymerase of the invention to dye primer sequencing. 

Cycle sequencing reactions are performed in a buffer consisting of 25 mM Tris-HCI (pH 9.1) and 3.5 mM MgCl2. 
Four individual reactions, one for each of the four dideoxy terminators, are performed. Reaction conditions for each of 
the four reactions are described below: 

1. Dideoxy-ATP reactions (5 ^il): 

100 jiM each dATP, dCTP, and dTTP (Perkin-Elmer), 
100 ^iM c7dGTP (Pharmacia, Piscataway, NJ), 
0.5 iM ddATP (Pharmacia), 

0.1 ng M13mp 1 8 single-strand DNA template (Perkin-Elmer). 
0.4 pmol JOE Dye Primer (PerWn-Elmer), 
1 unit DNA polymerase, and 

5 units of rTth Thermostable Inorganic Pyrophosphatase. 

2. Dideoxy-CTP reactions (5 ^!): 

100 jiM each dATP, dCTP. and dTTP (Perkin-Elmer), 
100 ^M c7dGTP (Pharmacia), 
0.5 isJt^ ddCTP (Pharmacia), 

0.1 ^g M13mp18 single-strand DNA template (Perkin-Elmer). 
0.4 pmol FAM Dye Primer (Perkin-Elmer), 
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1 unit DNA polymerase, and 

5 units of rTth Thermostable Inorganic Pyrophosphatase. 

3. DIdeoxy-GTP reactions (10 ^1): 

100 each dATR dCTP, and dTTP (Perkin-Elmer), 
100 ^iM c7dGTP (Pharmacia), 
0.5 nM ddQTP (Pharmacia), 

0.2 ng M13mp 18 single-strand DNA template (Perkin-Elmer), 
0.8 pmol TAMRA Dye Primer (Perkin-Elmer), 

2 units DNA polymerase, and 

10 units o1 rTth Thermostable Inorganic Pyrophosphatase. 

4. Dideoxy-TTP reactions (10 ^l): 

100 jaM each dATP, dCTP, and dTTP (Perkin-Elmer), 
100 c7dGTP (Pharmacia). 
0.5 \M ddlTP (Pharmacia). 

0.2 M13mp18 single-strand DNAtenrplate (Perkin-Elmer), 
0.8 pmol ROX Dye Primer (Perkin-Elmer), 
2 units DNA polymerase, and 

10 units of rTth Thermostable Inorganic Pyrophosphatase. 

Each of the four reactions are placed in a preheated (75**C) Perkin-Elmer GeneAmp® PGR System 9600 thermal 
cycler and subjected to 15 cycles of 96**C for 15 seconds. 55**C for 1 second, and TO'^C for 1 minute, followed by 15 
cycles of 96°C for 15 seconds and 70°C for 1 minute. The four reactions are pooled and precipitated by the addition of 
100 95% ethanol and 2.0 ^il 3 M sodium acetate (pH 5.3) at 4'*C for 15 minutes. The pooled reaction is microcentri- 
fuged for 15 minutes to collect precipitate, the supernatant is removed, and the pellet dried. The pellet is resuspended 
in 6 \i\ of delonized formamide/50 mM EDTA (pH 8.0)5/1 (v/v), heated at 90<*C for 2 minutes, and directly loaded onto a 
pre-electrophoresed 4% poIyacrylamide/6 M urea gel and electrophoresed and analyzed on an ABI PRISM™ 377 DNA 
Sequencer (Perkin Elmer, Nonwalk, CT) according to the manufacturer instructions. 

Example 7 

Effect of Pyrophosphatase 

In the dye-terminator reactions described in Example 5, above, 20 units of rTth Thermostable Inorganic Pyrophos- 
phatase (PPase) were added to the reaction to reduce the effects of pyrophosphorolysis. This amount of PPase had 
been determined to be beneficial for reactions using AmpliTaq® DNA Polymerase FS. The following experiments were 
carried out to determine the effect of PPase concentration on the results of cycle sequencing reactions using 
F730Yrma30 DNA Polymerase, 

Dye-terminator cycle sequencing reactions were carried out essentially as described in Example 5, above, with the 
exception that the PPase concentration was varied between reactions. PPase concentrations of 0, 0.5, 1 , and 20 units 
per reaction were tested. The target DNA, pGEM-3Zf(+), and the primer used, M13(-21), were from the ABI PRISM™ 
Dye Terminator Cycle Sequencing Core Kit. from Perkin Elmer (Nonwalk, CT). All reactions were done in duplicate. 

The results of each sequencing reaction were compared by direct comparison of the sequencing traces. The 
results revealed no obvious differences between the four PPase concentrations. Sequencing trace peak heights and 
background were comparable to a read of at least 500 base pairs. Thus, the data indicate that the use of F730Yrma30 
DNA Polymerase allows cycle sequencing reactions to be carried out without added PPase. 

Example 8 

Optimal dITP Concentration 

The ABI PRISM™ Dye Terminator Cycle Sequencing Core Kit with AmpliTaq® DNA Polymerase FS (Perkin Elmer, 
Nonwalk, CT), used in Example 5. above, provides a dNTP mix containing dITP. dATP, dCTP. and dTTP in a 5:1 :1 :1 ratio. 
The increased concentration of dITP compensates for the lower dITP incorporation efficiency possessed by AmpliTaq® 
DNA Polymerase FS. An analysis of the strength of the G signal peaks generated in the cycle sequencing reactions 
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described in Example 5 suggested that F730YTma30 DNA Polymerase incorporates dITP wrtli greater efficiency and, 
consequently, the dITP concentration should be decreased. Further reactions were carried out to determine an optimal 
concentration of dITP for use in dye-terminator cycle sequencing reactions using F730YT/77a30 DNA Polymerase. 

Reactions were carried out essentially as described in Example 5, using the ABI PRISM™ Dye Terminator Cycle 
Sequencing Core Kit with AmpliTaq® DNA Polymerase FS. In place of the dNTP mix provided with the kit, dNTP mixes 
containing 1 00 \M each dATP, dCTP, and dTTP, and a range of dITP concentrations in a TE buffer (1 0 mM Tris-HCI, pH 
8,0.1 mM EDTA) were used. As described in Example 5, a F730Yr/na30 DNA Pdymerase/rTf/r Thermostable Inor- 
ganic Pyrophosphatase mixture was substituted for the AmpliTaq® DNA Polymerase FS/rTth Thermostable Inorganic 
Pyrophosphatase mixture provided with the kit. 

The optimal dITP concentration was determined by comparisons of both the sequence traces and the unprocessed 
signal strength data. Based on these experiments, it was determined that the dITP concentration is preferably lowered 
to 150-250 |iM. The results indicate that F730Yrma30 DNA Polymerase Incorporates dITP significantly more efficiently 
than does AmpliTaq® DNA Polymerase FS. Further experiments carried out comparing FTSOYTmaSO DNA Polymer- 
ase to other 

thermostable DNA polymerases (results not shown) also indicated that F730Yr/na30 DNA Polymerase possesses 
a significantly increased efficiency of dITP incorporation relative to other thermostable DNA polymerases. 

The following deposit was made on the date given: 



Strain 


ATCC No. 


Deposit Date 


pUC18;Tma25 


98443 


May 28, 1997 



This deposit was made by ROCHE MOLECULAR SYSTEMS, Inc., 1145 Atlantic Avenue, Alameda. California 
94501, U.S.A., at the American Type Cutture Collection (ATCC), 12301 ParWawn Drive, Rockville, MD 20852. U.S.A. 
under the provisions of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure and the Regulations thereunder (Budapest Treaty). This assures maintenance of a via- 
ble culture for 30 years from date of deposit. The organism will be made available by ATCC under the terms of the Buda- 
pest Treaty, and subject to an agreement between aii^licants and ATCC, which assures permanent and unrestricted 
availability of the progeny of the cultures to the public upon issuance of the pertinent U.S. patent or upon laying open to 
the public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny to one 
determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 U.S.C. §122 
and the Commissioner's rules pursuant thereto (including 37 C.F.R. §1.14 with particular reference to 886 OQ 638). The 
assignee of the present application agrees that if the culture on deposit should die or be lost or destroyed when culti- 
vated under suitable conditions, it will be promptly replaced on notification with a viable specimen of the same culture. 
Availability of the deposited strain is not to be construed as a license to practice the invention in contravention of the 
rights granted under the authority of any government in accordance with its patent laws. 

ROCHE MOLECULAR SYSTEMS, Inc.. 1145 Atlantic Avenue, Alameda, California 94501. U.S.A has authorized F 
HOFFMANN-LA ROCHE AG, 124 Grenzacherstrasse, CH-4070 Basle, Switzerland, to refer to the aforementioned 
deposited biological material in foreign patent applications claiming priority from U.S. Patent Application Ser. No. 60- 
023376 and has given the unresen^ed and irrevocable consent that the deposited material is made available to the pub- 
lic. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: F .Hoffmann -La Roche Ltd 
{B) STREET: Grenzacherscrasse 124 

(C) CITY: Basel 

(D) STATE: BS 

(E) COONTRY: Switzerland 

10 (F) POSTAL CODE (ZIP) : CH-4070 

(G) TELEPHONE: (0)61 688 24 03 
(K) TELEFAX: (0|61 688 13 95 
(I) TELEX: 962292/965512 hlr ch 

(ii) TITLE OF INVENTION: Mutant chimeric DNA polymerases 

15 (iii) NUMBER OP SEQUENCES: 11 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy diak 

(B) COMPUTER: IBM PC coit^iacible 

(CI OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 

20 

ivi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 50/052,065 

(B) FILING DATE: 09-jaL-1997 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNB5S : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE; protein 



(xi) SEQUENCE DESCRIPTION; SEQ ID N0:1; 

^ Met Ala Arg Leu Phe Leu Phe Agp Gly Thr Ala Leu Ala Tyr Arg Ala 

15 10 15 

Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly He Pro Thr 
20 25 30 



40 



45 



Asn Ala Thr Tyr Gly Val Ala Arg Met Leu Val Arg Phe He Lys Asp 
35 40 45 

His He He Val Gly Lys Asp Tyr Val Ala Val Ala Phe Asp Lys Lys 

50 55 60 . 

Ala Ala Thr Phe Arg His Lys Leu Leu Glu Thr Tyr Lys Ala Gin Arg 
65 70 75 so 

Pro Lys Thr Pro Asp Leu Leu He Gin Gin Leu Pro Tyr He Lys Lys 
85 90 95 

Leu Val Glu Ala Leu Gly Met Lys Val Leu Glu Val Glu Gly Tyr Glu 
100 105 HO 

50 Ala Asp Asp He He Ala Thr Leu Ala Val Lys Gly Leu Pro Leu Phe 

115 120 125 

Asp Glu He Phe He Val Thr Gly Asp Lys Asp Met Leu Gin Leu Val 
130 135 140 

55 
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Asn Glu Lys lie Lys Val Trp Arg lie Val Lys Gly lie Ser Asp Leu 
145 150 155 160 

Glu Leu Tyr Asp Ala Gin Lys Val Lys Glu Lys Tyr Gly Val Glu Pro 
165 170 175 

Gin Gin lie Pro Asp Leu Leu Ala Leu Thr Gly Asp Glu lie Asp Asn 
180 185 190 

He Pro Gly Val Thr Gly He Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 

Glu Lys Tyr Lys Asp Leu Glu Asp He Leu Asn His Val Arg Glu Leu 
210 215 220 

Pro Gin Lys Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Asn Ala He 
225 230 235 240 

Leu Ser Lys Lys Leu Ala He Leu Glu Thr Asn Val Pro He Glu He 
245 250 255 

Asn Trp Glu Glu Leu Arg Tyr Gin Gly Tyr Asp Arg Glu Lys Leu Leu 
260 265 270 

Pro Leu Leu Lys Glu Leu Glu Phe Ala Ser He Met Lys Glu Leu Gin 
275 280 285 

Leu Tyr Glu 
290 



INFORMATION PGR SEQ ID NO: 2: 

(i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES5 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 2: 

Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 IS 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 dO 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He Val 
50 55 60 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly 
65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 
85 90 95 

Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu 
100 105 110 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys 



115 



120 



125 



Ala 



Glu Lys Glu 
130. 



Gly Tyr Glu Val 
135 



Arg He Leu Thr Ala Asp Lys Asp 
.140 
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10 



Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu Gly 

145 150 155 160 

Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 
165 170 175 

Aap Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn 
180 185 190 

Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu Leu 
195 200 205 

Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu 
210 215 220 

Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu Lys 

225 230 235 240 



15 



Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val 
245 250 255 



Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe 
260 265 270 



20 



Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 



Glu 



(2) rNFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 amino acids 

(B) TYPE: aaiino acid 

(C) STRAB3DEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
^ 15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 



40 



45 



50 



Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Val Val Val Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Arg Ala 
115 120 . 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 



55 
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Tyr Gin Leu Leu Ser Glu Arg He Ala He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Tyr Glu Lys Tyr Gly Leu Arg Pro Glu 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Gin Arg Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Phe Gin His Leu Asp Gin Val Lys 
210 215 220 

Pro Ser Leu Arg Glu Lys Leu Gin Ala Gly Met Glu Ala Leu Ala Leu 
225 230 235 240 

Ser Arg Lys Leu Ser Gin Val His Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Gly Arg Arg Arg Thr Pro Asn Leu Glu Gly Leu Arg Ala Phe Leu 
260 265 270 

Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu 
20 275 280 285 

I 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 amino acids 
2^ (B) TYPE: amino acid 

(C) STRANDEDNES5: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



10 



15 



30 



35 



40 



45 



50 



55 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
1 5 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 

65 70 75 ^ 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg 
130 135 .140 

Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
145 150 155 . ■ .160 
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Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

I 

Pro Glu Gin Trp Val Asp Phe Axg Ala Leu Val Gly Asp Pro Ser Asp 
180 185 190 

Asn Leu* Pro Gly Val Lys Gly He GLy Glu Lys Thr Ala Leu Ly3 Leu 
195 200 205 

Leu Ly3 Glu Trp Gly Ser Leu Qlu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu Glu Asp 
225 230 235 240 

Leu Axg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu 
290 



(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLEan.E TYPE: protein 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Lys Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
1 5 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe 

50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Pro Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg 
130 135 140 

Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
145 150 155 . 160 
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Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys 
165 170 175 

Pro Glu Gin Trp Val Asp Phe Arg Ala Leu val Gly Asp Pro Ser Asp 
100 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys Leu 
195 200 205 

Leu Lys Glu Trp Gly Ser Leu Glu Asn He Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Glu Ser Val Arg Glu Arg He Lys Ala His Leu Glu Asp 
225 230 235 240 

Leu Lys Leu Ser Leu Glu Leu Ser Arg Val Arg Ser Asp Leu Pro Leu 
245 250 255 

15 Glu Val Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Arg 

260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu 
20 290 

U) IKFORMATION FOR SEQ ID NO: 6: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
1 5 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 

3S 20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala Val Phe 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 

65 70 75 • 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 . 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

Asn Pro Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg 
130 135 140 

Asp Leu Asp Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
145 150 155 . 160 



55 



40 



45 



SO 



31 

03/15/2002, EAST Version: 1.03.0002 



EP0 892 058 A2 



Gly His Leu lie Thr Pro Glu Trp Leu Trp Gin Lys Tyr Gly Leu Lys 

165 170 175 

Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys Leu 
195 200 205 



10 



Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu Glu Asp 
225 230 235 240 



Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu 
245 250 255 



15 



Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly Leu Arg 
260 265 270 



Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 



20 



Leu Leu Glu 
290 



(2) INFORMATION FOR SEQ ID NO: 7: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 287 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEEHESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 
1 5 10 15 

His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu Thr Thr 
35 20 25 30 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
35 40 45 



40 



45 



Leu Lys Ala Leu Lys Glu Asp Gly Glu Val Ala He Val Val Phe Asp 
50 55 60 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala Tyr Lys Ala 
65 70 75 80 

Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu He 
85 90 . 95 

Lys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val Pro Gly 
100 105 110 

Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala Glu Arg 
115 120 125 

Glu Gly Tyr Glu Val Arg He Leu Ser Ala Asp Arg Asp Leu Tyr Gin 
130 135 140 

Leu Leu Ser Asp Arg He His Leu Leu His Pro Glu Gly Glu Val Leu 
145 150 155 . 160 



55 
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Thr Pro Gly Trp Leu Gin Glu Arg Tyr Gly Leu Ser Pro Qlu Arg Trp 
165 170 175 

Val Glu Tyr Arg Ala Leu Val Gly Asp Pro Ser Asp Asn Leu Pro Gly 
180 185 190 

Val Pro Gly lie Gly Glu Lys Thr Ala Leu Lys Leu Leu Lys Glu Trp 
195 200 205 

Gly Ser Leu Glu Ala lie Leu Lys Asn Leu Asp Gin Val Lys Pro Glu 
210 215 220 

Arg Val Arg Glu Ala He Arg Asn Asn Leu Asp Lys Leu Gin Met Ser 
225 230 235 240 

Leu Glu Leu Ser Arg Leu Arg Thr Asp Leu Pro Leu Glu Val Asp Phe 
245 250 255 

Ala Lys Arg Arg Glu Pro Asp Trp Glu Gly Leu Lys Ala Phe Leu Glu 
260 265 270 

Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 287 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 



30 



35 



40 



45 



SO 



ss 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Leu Pro Leu Leu Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 
15 10 15 

His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu Thr Thr 
20 25 30 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
35 40 45 

Leu Lys Ala Leu Lys Glu Asp Gly Glu Val Ala He Val Val Phe Asp 
50 55 60 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala Tyr Lys Ala 
65 70 75 80 

Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala Leu He 
85 90 95 

Lys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val Pro Gly 
100 105 110 

Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Arg Lys Ala Glu Arg 
115 120 125 

Glu Gly Tyr Glu Val Arg He Leu Ser Ala Asp Arg Asp Leu Tyr Gin 
130 135 140 

Leu Leu Ser Asp Arg He His Leu Leu His Pro Glu Gly Glu Val Leu 
145 150 155 160 

Thr Pro Gly Trp Leu Gin Glu Arg Tyr Gly Leu Ser Pro Glu Arg Trp 
165 170 175 
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Val Glu Tyr Arg Ala Leu Val Gly Asp Pro Ser Asp Aan Leu Pro Gly 
180 185 190 

Val Pro Gly lie Gly Glu Lys Thr Ala Leu Lys Leu Leu Lys Glu Trp 
195 200 205 

Gly Ser Leu Glu Ala lie Leu Lys Asn Leu Asp Gin Val Lys Pro Glu 
210 215 220 

Arg Val Trp Glu Ala lie Arg Asn Asn Leu Asp Lys Leu Gin Met Ser 
225 230 235 240 

Leu Glu Leu Ser Arg Leu Arg Thr Asp Leu Pro Leu Glu Val Asp Phe 
245 250 255 

Ala Lys Arg Arg Glu Pro Asp Trp Glu Gly Leu Lys Ala Phe Leu Glu 
260 265 270 

IS Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu 

275 280 285 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 2682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 
iD) TOPOLOGY: linear 



2S 



30 



50 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATGAGAGGCA TGCTTCCACT TTTTGAGCCC AAGGGCCGGG TCCTCCTGGT GGACGGCCAC 60 

CACCTGGCCT ACCGCACCTT CCACGCCCTG AAGGGCCTCA CCACCAGCCG GGGGGAGCCG 120 

GTGCAGGCGG TCTACGACTT CGCCAAGAGC CTCCTCAAGG CCCTCAAGGA GGACGGGGAC 180 

GCGGTGATCG TGGTCTTTGA CGCCAAGGCC CCCTCCTTCC GCCACGAGGC CTACGGTGGG 240 

TACAAGGCGG GCCGGGCCCC CACGCCGGAG GACTTTCCCC GGCAACTCGC CCTCATCAAG 300 

35 GAGCTGGTAG ATCTCCTGGG GCTGGCGCGC CTCGAGGTCC CGGGCTACGA GGCGGACGAC 360 

GTCCTGGCCA GCCTGGCCAA GAAGGCGGAA AAGGAGGGCT ACGAGGTCCG CATCCTCACC 420 

GCCGACAAAG ACCTTTACCA GCTCCTTTCC GACCGCATCC ACGTCCTCCA CCCCGAGGGG 480 

TACCTCATCA CCCCGGCCTG GCTTTGGGAA AAGTACGGCC TGAGGCCCGA CCAGTGGGCC 540 

40 

GACTACCGGG CCCTGACCGG GGACGAGTCC GACAACATCC CCGGGGTCAC TGGGATCGGT 600 

GAGAAGACTG CTGTTCAGCT TCTAGAGAAG TACAAAGACC TCGAAGACAT ACTGAATCAT - 660 

GTTCGCGAAC TTCCTCAAAA GGTGAGAAAA GCCCTGCTTC GAGACAGAGA AAACGCCATT 720 

45 CTCAGCAAAA AGCTGGCGAT TCTGGAAACA AACGTTCCCA TTGAAATAAA CTGGGAAGAA 780 

CTTCGCTACC AGGGCTACGA CAGAGAGAAA CTCTTACCAC TTTTGAAAGA ACTGGAATTC 840 

GCATCCATCA TGAAGGAACT TCAACTGTAC GAAGAGTCCG AACCCGTTGG ATACAGAATA 900 

GTGAAAGACC TAGTGGAATT TGAAAAACTC ATAGAGAAAC TGAGAGAATC CCCTTCGTTC 960 

GCCATAGATC TTGAGACGTC TTCCCTCGAT CCTTTCGAGT GCGACATTGT CGGTATCTCT 1020 

GTGTCTTTCA AACCAAAGGA AGCGTACTAC ATACCACTCC ATCATAGAAA CGCCCAGAAC 10 80 



55 
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10 



15 



20 



25 



30 



35 



40 



SO 



55 



CTGGACGAAA 


AAGAGGTTCT 


GAAAAAGCTC 


AAAGAAATTC 


TGGAGGACCC 


CGGAGCAAAG 


1140 


ATCGTTGGTC 


AGAATTTGAA 


ATTCGATTAC 


AAGGTGTTGA 


TGGTGAAGGG 


TGTTGAACCT 


1200 


GTTCCTCCTT 


ACTTCGACAC 


GATGATAGCG 


GCTTACCTTC 


TTGAGCCGAA 


CGAAAAGAAG 


1260 


TTCAATCTGG 


ACGATCTCGC 


ATTGAAATTT 


CTTGGATACA 


AAATGACATC 


TTACCAAGAG 


1320 


CTCATGTCCT 


TCTCTTTTCC 


GCTGTTTGGT 


TTCAGTTTTG 


CCGATGTTCC 


TGTAGAAAAA 


1380 


GCAGCGAACT 


ACTCCTGTGA 


AGATGCAGAC 


ATCACCTACA 


GACTTTACAA 


GACCCTGAGC 


1440 


TTAAAACTCC 


ACGAGGCAGA 


TCTGGAAAAC 


GTGTTCTACA 


AGATAGAAAT 


GCCCCTTGTG 


1500 


AACGTGCTTG 


CACGGATGGA 


ACTGAACGGT 


GTGTATGTGG 


ACACAGAGTT 


CCTGAAGAAA 


1560 


CTCTCAGAAG 


AGTACGGAAA 


AAAACTCGAA 


GAACTGGCAG 


AGGAAATATA 


CAGGATAGCT 


1620 


GGAGAGCCGT 


TCAACATAAA 


CTCACCGAAG 


CAGGTTTCAA 


GGATCCTTTT 


TGAAAAACTC 


1680 


GGCATAAAAC 


CACGTGGTAA 


AACGACGAAA 


ACGGGAGACT 


ATTCAACACG 


CATAGAAGTC 


1740 


CTCGAGGAAC 


TTGCCGGTGA 


ACACGAAATC 


ATTCCTCTGA 


TTCTTGAATA 


CAGAAAGATA 


1800 


CAGAAATTGA 


AATCAACCTA 


CATAGACGCT 


CTTCCCAAGA 


TGGTCAACCC 


AAAGACCGGA 


1860 


AGOATTCATG 


CTTCTTTCAA 


TCAAACGGGG 


ACTGCCACTG 


GAAGACTTAG 


CAGCAGCGAT 


1920 


CCCAATCTTC 


AGAACCTCCC 


GACGAAAAGT 


GAAGAGGGAA 


AAGAAATCAG 


GAAAGCGATA 


1980 


GTTCCTCAGG 


ATCCAAACTG 


GTGGATCGTC 


AGTGCCGACT 


ACTCCCAAAT 


AGAACTGAGG 


2040 


ATCCTCGCCC 


ATCTCAGTGG 


TGATGAGAAT 


CTTTTGAGGG 


CATTCGAAGA GGGCATCGAC 


2100 


GTCCACACTC 


TAACAGCTTC 


CAGAATATTC 


AACGTGAAAC 


CCGAAGAAGT AACCGAAGAA 


2160 


ATGCGCCGCG 


CTGGTAAAAT 


GGTTAATTTT 


TCCATCATAT 


ACGGTGTAAC ACCTTACGGT 


2220 


CTGTCTGTGA 


GGCTTGGAGT 


ACCTGTGAAA 


GAAGCAGAAA 


AGATGATCGT 


CAACTACTTC 


2280 


GTCCTCTACC 


CAAAGGTGCG 


CGATTACATT 


CAGAGGGTCG 


TATCGGAAGC 


GAAAGAAAAA 


2340 


GGCTATGTTA 


GAACGCTGTT 


TGGAAGAAAA 


AGAGACATAC 


CACAGCTCAT GGCCCGGGAC 


2400 


AGGAACACAC 


AGGCTGAAGG 


AGAACGAATT 


GCCATAAACA 


CTCCCATACA GGGTACAGCA 


2460 


GCGGATATAA 


TAAAGCTGGC 


TATGATAGAA 


ATAGACAGGG 


AACTGAAAGA AAGAAAAATG 


2520 


AGATCGAAGA 


TGATCATACA 


GGTCCACGAC 


GAACTGGTTT 


TTGAAGTGCC CAATGAGGAA 


2580 


AAGGACGCGC 


TCGTCGAGCT 


GGTGAAAGAC 


AGAATGACGA 


ATGTGGTAAA 


GCTTTCAGTG 


2640 


CCGCTCGAAG 


TGGATGTAAC 


CATCGGCAAA 


ACATGGTCGT 


GA 




2682 



(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 893 amino acids 

(B) TVFE: amino acid 

(C) STRANDEDNESS : single 
45 (Dl TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: . 

Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Ala Tyr Arg Ala 

1 ' . . 5 . 10 ■ 15^ , 
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Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly lie Pro Thr 
20 25 30 

Aan Ala Thr Tyr Gly Val Ala Arg Met Leu Val Arg Phe lie Lys Asp 
35 40 45 

Hig lie lie Val Gly Lys Asp Tyr Val Ala Val Ala Phe Asp Lys Lys 
50 55 60 

Ala Ala Thr Phe Arg His Lys Leu Leu Glu Thr Tyr Lys Ala Gin Arg 
65 70 75 80 

Pro Lys Thr Pro Asp Leu Leu He Gin Gin Leu Pro Tyr He Lys Lya 
85 90 95 



15 



Leu Val Glu Ala Leu Gly Met Lys Val Leu Glu Val Glu Gly Tyr Glu 
100 105 110 

Ala Asp Asp He He Ala Thr Leu Ala Val Lys Gly Leu Pro Leu Phe 
115 120 125 



Asp Glu He Phe He Val Thr Gly Asp Lys Asp Met Leu Gin Leu Val 
130 135 140 



20 



Asn Glu Lys He Lys Val Trp Arg He Val Lys Gly He Ser Asp Leu 
145 150 155 160 



25 



Glu Leu Tyr Asp Ala Gin Lys Val Lys Glu Lys Tyr Gly Val Glu Pro 
165 170 175 

Gin Gin He Pro Asp Leu Leu Ala Leu Thr Gly Asp Glu He Asp Asn 
180 185 190 

He Pro Gly Val Thr Gly He Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 



30 



Glu Lys Tyr Lys Asp Leu Glu Asp He Leu Asn His Val Arg Glu Leu 
210 215 220 

Pro Gin Lys Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Asn Ala He 
225 230 235 240 



Leu Ser Lys Lys Leu Ala He Leu Glu Thr Asn Val Pro He Glu He 
245 250 255 



35 



Asn Trp Glu Glu Leu Arg Tyr Gin Gly Tyr Asp Arg Glu Lys Leu Leu 
260 265 270 



40 



Pro Leu Leu Lys Glu Leu Glu Phe Ala Ser He Met Lys Glu Leu Gin 

275 280 285 

Leu Tyr Glu Glu Ser Glu Pro Val Gly Tyr Arg He Val Lys Asp Leu 
290 295 300 

Val Glu Phe Glu Lys Leu He Glu Lys Leu Arg Glu Ser Pro Ser Phe 

305 310 315 320 



45 



Ala He Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asp Cys Asp He 
325 330 335 

Val Gly He Ser Val Ser Phe Lys Pro Lys Glu Ala Tyr Tyr He Pro 
340 345 350 



Leu His His Arg Asn Ala Gin Asn Leu Asp Glu Lys Glu Val Leu Lys 
355 360 .365 



SO 



Lys Leu Lys Glu He Leu Glu Asp Pro Gly Ala Lys He Val Gly Gin 
370 375 380 



Asn Leu Lys Phe Asp Tyr Lys Val Leu Met Val Lys Gly Val Glu Pro 
385 390 395 ' .400 



55 
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Val Pro Pro Tyr Phe Asp Thr Met He Ala Ala Tyr Leu Leu Glu Pro 
405 410 415 

Asn Glu Lys Lys Phe Asn Leu Asp Asp Leu Ala Leu Lys Phe Leu Gly 
420 425 430 

Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Phe Pro Leu 
435 440 445 

Phe Gly Phe Ser Phe Ala Asp Val Pro Val Glu Lys Ala Ala Asn Tyr 
450 455 460 

Ser Cys Glu Asp Ala Asp He Thr Tyr Arg Leu Tyr Lys Thr Leu Ser 
465 470 475 480 

Leu Lys Leu His Glu Ala Asp Leu Glu Asn Val Phe Tyr Lys He Glu 
485 490 495 

Met Pro Leu Val Asn Val Leu Ala Arg Met Glu Leu Asn Gly Val Tyr 
500 505 510 

Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys Lys 
515 520 525 

Leu Glu Glu Leu Ala Glu Glu lie Tyr Arg He Ala Gly Glu Pro Phe 
530 53S 540 

Asn He Asn Ser Pro Lys Gin Val Ser Arg He Leu Phe Glu Lys Leu 
545 550 555 560 

Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr Gly Asp Tyr Ser Thr 
565 570 575 

Arg He Glu Val Leu Glu Glu Leu Ala Gly Glu His Glu He He Pro 
530 585 590 

Leu He Leu Glu Tyr Arg Lys He Gin Lys Leu Lys Ser Thr Tyr He 
595 600 605 

Asp Ala Leu Pro Lys Met Val Asn Pro Lys Thr Gly Arg He His Ala 
610 615 620 

Ser Phe Asn Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
625 630 635 640 

Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu He 
645 650 655 

Arg Lys Ala He Val Pro Gin Asp Pro Asn Trp Trp He Val Ser Ala 
660 665 670 

Asp Tyr Ser Gin He Glu Leu Arg He Leu Ala His Leu Ser Gly Asp 
675 680 685 

Glu Asn Leu Leu Arg Ala Phe Glu Glu Gly He Asp Val His Thr Leu 
690 695 700 

Thr Ala Ser Arg He Phe Asa Val Lys Pro Glu Glu Val Thr Glu Glu 
705 710 715 720 

Met Arg Arg Ala Gly Lys Met Val Asn Phe Ser He He Tyr Gly Val 
725 730 , 735 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly Val Pro Val Lys Glu Ala 
740 745 750 

Glu Lys Met He Val Asn Tyr Phe Val Leu Tyr Pro Lys Val Arg Asp 
755 760 765 



Tyr He Gin Arg Val Val Ser Glu Ala Lys Glu Lys Gly Tyr Val Arg 
770 775 780 
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Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin Leu Met Ala Arg Asp 
785 790 795 800 



Arg Asn Thr Gin Ala Glu Gly Glu Arg He Ala He Asn Thr 
805 810 



Pro lie 
815 



Gin Gly Thr Ala Ala Asp He He Lys Leu Ala Met He Glu He Asp 

820 825 830 

Arg Glu Leu Lys Glu Arg Lys Met Arg Ser Lys Met He lie Gin Val 
835 840 845 

His Asp Glu Leu Val Phe Glu Val Pro Asn Glu Glu Lys Asp Ala Leu 
850 855 860 



Val Glu Leu Val Lys Asp Arg Met Thr Asn Val Val Lys Leu 
865 870 875 

Pro Leu Glu Val Asp Val Thr He Gly Lys Thr Trp Ser 
885 890 



Ser Val 
880 



(2) INFORMATION FOR SEQ ID NO; 11: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D| TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11; 

GGGAAGGGCG ATCGGTGCGG GCCTCTTCGC 30 



Claims 

1 . A thermostable DNA polymerase consisting of an N-terminal region and a C-terminal region, wherein said N-termi- 
nai region consists of amino acids 1 through n of a Thermus species DNA polymerase, wherein n is an amino acid 
corresponding to an amino acid m of Thermatoga maritima (Tma) DNA polymerase, SEQ m NO: 10, wherein m is 
between 137 and 291; 

wherein said C-terminal region consists of amino acids m+1 through 893 of Tma DNA polymerase SEQ ID 
NO: 10; 

wherein said N-terminal region is modified by at least one point mutation that substantially reduces or elimi- 
nates 5'-nuclease activity when present in said Thermus species DNA polymerase, or said C-terminal region- 
is modified by at least one point mutation within the region that is amino adds nnl to 291 of Tma DNA 
polymerase that substantially reduces or eliminates 5'-nuclease activity when present in Tma DNA polymer- 
ase; 

wherein said C-terminal region is modified by at least one point mutation that substantially reduces 3' to 5' exo- 

nuclease activity when present in Tma DNA polymerase; and 

wherein said C-terminal region is modified to contain a tyrosine at annino acid 730. 

2. The thermostable DNA polymerase of Claim 1, wherein said N-tenninal region contains a point mutation at an 
amino acid position corresponding to an amino acid in Taq DNA polymerase selected from the group consisting of 
D18. R25, G46. D67, F73, R74, Y81. G107. E117. D119. D120, D142, D144, G187, D188, D191, and G195 
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3. The thermostable DNA polymerase of Claim 1, wherein said C-terminal region contains a point mutation at an 
amino acid position selected from the group consisting of D323, E325, L329, N385, D389, L393, Y464, and D468. 

4. The thermostable DNA polymerase of Claim 2, wherein said N-terminal region contains an aspartic acid at an 
amino acid position corresponding to amino add G46 in Tag DNA polymerase. 

5. The thermostable DNA polymerase of Claim 3, wherein said C-terminal region contains a D323A or E325A muta- 
tion. 

6. The thermostable DNA polymerase of Claim 1 , wherein said Thermus species is selected from the group consist- 
ing of Thermus aquaticus, Thermus flavus, Thermus thermophHus, Thermus species Z05, Thermus caldofilus, 
Thermus species sps1 7, Thermus flliformis. 

7. The thermostable DNA polymerase of Claim 6. wherein said Thermus species is Thermus aquaticus. 

8. The thermostable DNA polymerase of Claim 7, wherein n = 190. 

9. The thermostable DNA polymerase of Claim 8, wherein said N-terminal region contains an G46D mutation, and 
wherein said C-terminal region contains a D323A mutation and a E325A mutation. 

10. An isolated DNA that encodes a thermostable DNA polymerase as claimed in any one of claims 1 to 9. 

1 1 . A plasmid comprising a DNA that encodes a thermostable DNA polymerase as claimed in any one of claims 1 to 9. 

12. An expression vector comprising a DNA that encodes a thermostable DNA polymerase as claimed in any one of 
claims 1 to 9. 

1 3. A host cell transformed with an expression vector comprising a DNA that encodes a thermostable DNA polymerase 
as claimed in any one of claims 1 to 9. 

14. A method for preparing a thermostable DNA polymerase, comprising; 

(a) culturing a host cell transformed with an expression vector comprising a DNA that encodes a thermostable 
DNA polymerase as claimed in any one of claims 1 to 9 under conditions which promote the expression of ther- 
mostable DNA polymerase; and 

(b) isolating thermostable DNA polymerase from said host cell. 

15. A thermostable DNA polymerase prepared by the method as claimed in claim 14. 

16. A method for sequencing a nucleic acid wherein a thermostable DNA polymerase as claimed in any one of claims 
1 to 9 or 15 is used. 

17. Use of a thermostable DNA polymerase as claimed in any one of claims 1 to 9 oris in a nucleic acid amplification 
or sequencing reaction. 

18. A composition comprising a thermostable DNA polymerase as claimed in any on& of claims 1 to 9 or 15 and one 
or more non-ionic polymeric detergents. 

19. A kit for carrying out a primer extension reaction, comprising thermostable DNA polymerase as claimed in any one 
of claims 1 to 9 or 15. 
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